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ABSTRACT 


The Baryon Oscillation Spectroscopic Survey (BOSS) is designed to measure the scale of baryon 
acoustic oscillations (BAO) in the clustering of matter over a larger volume than the combined efforts 
of all previous spectroscopic surveys of large scale structure. BOSS uses 1.5 million luminous galaxies 
as faint as i = 19.9 over 10,000 deg? to measure BAO to redshifts z < 0.7. Observations of neutral 
hydrogen in the Lya forest in more than 150,000 quasar spectra (g < 22) will constrain BAO over the 
redshift range 2.15 < z < 3.5. Early results from BOSS include the first detection of the large-scale 
three-dimensional clustering of the Lya forest and a strong detection from the Data Release 9 data 
set of the BAO in the clustering of massive galaxies at an effective redshift z = 0.57. We project 
that BOSS will yield measurements of the angular diameter distance D4 to an accuracy of 1.0% at 
redshifts z = 0.3 and z = 0.57 and measurements of H(z) to 1.8% and 1.7% at the same redshifts. 
Forecasts for Lya forest constraints predict a measurement of an overall dilation factor that scales 
the highly degenerate D4(z) and H^ !(z) parameters to an accuracy of 1.9% at z ~ 2.5 when the 
survey is complete. Here, we provide an overview of the selection of spectroscopic targets, planning 
of observations, and analysis of data and data quality of BOSS. 
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1. INTRODUCTION 


The large scale structure of the universe, as traced 
by galaxies in redshift surveys and by intergalactic hy- 
drogen absorption towards distant quasars, has played 
a central role in establishing the modern cosmological 
model based on inflation, cold dark matter, and dark en- 
ergy. Key steps in this development have included the 
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1999), the Las Campanas Redshift Sur- 


vey (Shectman et al. , and the 'Two Degree Field 
Galaxy Redshift Survey (2dFGRS; [Colles et all [2001]. 
The largest and most powerful redshift surveys to date 
have been those from the Sloan Digital Sky Survey 
(SDSS; [York et al] [2000], which measured redshifts of 


nearly one million galaxies in spectroscopic observations 
between 2000 and 2008 (the phases known as SDSS-I 
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and -II, described in the seventh data release, DR7, by 
2009). The SDSS also obtained the most 


precise constraints to date on structure at 2 < z < 4 us- 
ing Lya forest absorption towards ~ 3000 high redshift 
quasars (McDonald et al]2003) 200d [Scliak et aL]2008). 
building on earlier studies that analyzed much smaller 
samples of higher resolution spectra (Croft et al. 

This paper describes the Baryon Oscillation Spectro- 
scopic Survey (BOSS), the largest of the four surveys 
that comprise SDSS-III (Sisenstem ct alI2O11). Shorter 
summaries of BOSS have appeared previously in an As- 
tro2010 white paper (Schlegel et a1]|2009), in the SDSS- 


III overview paper (Eisenstein et al]|2011), and in pa- 
pers presenting some of the survey's early cosmolog- 


ical results (e.g. 

Large-scale structure offers a novel tool to make precise 
measurements of cosmic distances via baryon acoustic 
oscillations (BAO), a feature imprinted on the cluster- 
ing of matter by acoustic waves that propagate in the 


pre-recombination universe. A long-standing prediction 
of cosmological models (Sakharov Peebles & Y: 
Sunyaev & Zel’dovich||1970), BAO rose to promi- 


nence in recent years as a tool to measure the ex- 
pansion history of the universe (Eisenstein et al 
"The 
first clear detections of BAO came in 2005 from analyses 
of the SDSS (Eisenstein et al and the 2dFGRS 
galaxy samples, and even these first 
discoveries set few-percent constraints on the cosmic dis- 
tance scale. With sufficient data, the BAO “standard 
ruler” can be used to separately measure the angular 
diameter distance D 4(z) from transverse clustering and 
the Hubble expansion rate H(z) from line-of-sight clus- 
tering. Da(z) and H(z) have not yet been measured 
separately at any redshift. A particular attraction of 
the BAO method is its low susceptibility to systematic 
errors, a feature highlighted in the report of the Dark En- 
ergy Task Force (DETF; 2006). The BAO 
method is reviewed in detail by|Weinberg et al. see 
their $4), including discussions of the underlying theory, 
the effects of non-linear evolution and galaxy bias, sur- 
vey design and statistical errors, control of systematics, 
recent observational results, and complementarity with 
other probes of cosmic acceleration. 

A number of recent cosmological surveys were designed 


with BAO measurement as a major goal. The recently 
completed WiggleZ survey ob- 
served 200,000 emission-line galaxies over 800 deg?, ob- 
taining the first BAO measurements at 0.5 « z « 1.0, 
with an aggregate distance precision of 3.8% at a central 
redshift z = 0.6 (Blake et al||2011b). The Six-Degree 
Field Galaxy Survey (6dFGS; was de- 


signed to map structure in the local universe; it produced 
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a 4.5% BAO distance measurement at redshift z ~ 0.1 
(Beutler et al] [2011]. The primary SDSS BAO results 
came from its redshift survey of luminous red galaxies 
(LRGs; , with color selection used 
to separate high luminosity targets from nearby galax- 
ies at similar apparent magnitudes. This survey, reach- 
ing a limiting apparent magnitude r — 19.5, provided 
a sparse sample of 106,000 strongly clustered galaxies 
over a large volume, well suited to measuring structure 
on BAO scales. The LRG survey achieved a roughly 
constant comoving space density of 1074 ?Mpc ? from 
z = 0.16 to z = 0.40, with a declining density out to 
z ~ 0.45. derived the power spec- 
trum from the LRG sample, the DR7 galaxy sample, 
and the 2dFGRS sample to obtain a 2.7% measurement 
of the BAO scale at z = 0.275 using a total of 893,319 
galaxies over 9100 deg?. used the 
full LRG sample from DR7 to measure the galaxy cor- 
relation function and obtain a 3.5% measurement of the 
BAO distance scale at z = 0.35. 

The BAO measurements described above suffer from 
a degradation in the detection significance due to large- 
scale flows and non-linear evolution in the density field. 
As matter diffuses due to non-linear evolution and galaxy 
peculiar velocities, the BAO peak becomes broader and 
more difficult to constrain, particularly at low redshift. 
These effects can partly be reversed by “reconstruc- 
tion”, a technique by which the observed galaxy field 
is used to estimate the large-scale gravitational poten- 
tial (Eisenstein et al||2007). Using the Zel'dovich ap- 
proximation (Zel'dovichl|1970), the diffusion of galax- 
ies can then be measured and the density fluctua- 
tions can be shifted back to their Lagrangian posi- 
tions, thereby restoring the acoustic peak in both real 
and redshift space. Redshift-space distortions 


are removed in the same manner. The approach 


was refined, tested on simulated data, and first ap- 
plied to survey data using the full SDSS LRG sample 
(dust aC). Thee of the scour. 
tic peak was measured from the reconstructed data to 
an accuracy of 1.9% (Xu et al][2012), an improvement 
of almost a factor of two over the 3.596 measurement 
before reconstruction. Combining this distance mea- 
surement with the WMAP7 cosmic microwave back- 
ground (CMB) anisotropies leads 
to an estimate of Hg = 69.8 + 1.2 km s~!Mpc7! and 
Om = 0.280 + 0.014 for a flat Universe with a cosmo- 
logical constant (w = —1). The cosmological constraints 
are additionally improved when the reconstructed SDSS 
data are combined with Type Ia supernovae (SN Ia) 
from the SN Legacy Survey (SNLS; puc b 
measurements of Ho from the Hubble Space Telescope 
2011), and BAO constraints from the 6dF 
Galaxy Survey (Beutler et al||2011) and the WiggleZ 
Dark Energy Survey (Blake et al. . 
use these measurements to constrain w = —1.03+ 
0.08 for a cosmological model in which the equation of 
state for dark energy is allowed to vary. 

As the name suggests, BOSS is a survey designed to 
measure the universe using BAO. BOSS uses a rebuilt 
spectrograph from the original SDSS survey with smaller 
fibers, new improved detectors, higher throughput, and 
a wider wavelength range, enabling a spectroscopic sur- 


vey to higher redshift and roughly one magnitude deeper 
than SDSS. We hereafter refer to the upgraded instru- 
ments as the BOSS spectrographs. The BOSS spec- 
trographs and their SDSS predecessors are mounted to 
the telescope and are described in detail by [Smee et al. 
(2012). In brief, there are two double spectrographs, 
each covering the wavelength range 361 nm — 1014 nm 
with resolving power A/AA ranging from 1300 at the 
blue end to 2600 at the red end. Both spectrographs 
have a red channel with a 4k x 4k, 15 wm pixel CCD 
from Lawrence Berkeley National 
Laboratory (LBNL). We denote these channels R1 and 
R2 for the first and second spectrographs, respectively. 
Similarly, both spectrographs have blue channel with a 
4k x 4k, 15 um pixel CCD from e2v denoted B1 and 
B2. The instrument is fed by 1000 optical fibers (500 
per spectrograph), each subtending 2" on the sky. (The 
original spectrographs used 640 3" fibers.) When the sur- 
vey is complete, fibers will have been plugged into more 
than 2000 unique spectroscopic plates that each cover a 
circular field of view with 3? diameter. Aluminum cast 
cartridges support the optical fibers, spectrograph slith- 
ead, and spectroscopic plug plate. 

BOSS consists primarily of two interleaved spectro- 
scopic surveys observed simultaneously: a redshift survey 
of 1.5 million luminous galaxies extending to z — 0.7, and 
a survey of the Lya forest towards 150,000 quasars in the 
redshift range 2.15 € z < 3.5. BOSS uses the same wide 
field, dedicated telescope as was employed by SDSS-I and 


-II, the 2.5 m-aperture Sloan Foundation Telescope at 
Apache Point Observatory in New Mexico (Gunn et al. 


[2009]. Those surveys imaged more than 10000 deg? of 
high latitude sky in the ugriz bands 
(1993). using a mosaic CCD camera with a field of view 
spanning 2.5? (Gunn et al][1998). As discussed in q2.1] 
SDSS-III completed roughly 2500 deg? of new imaging 
to enlarge the footprint available to BOSS. All of the 
imaging, including that from SDSS-I and -II, is available 
in SDSS Data Release 8 (DR8; [Aihara et al]D011a), the 
first data release from SDSS-III. The forthcoming Data 
Release 9 will present the first public 
release of BOSS spectroscopic data, containing observa- 
tions completed prior to August 2011. 

The defining goal of the BOSS galaxy survey is to pro- 
duce BAO measurements limited mainly by sample vari- 
ance (as opposed to galaxy shot noise) over the volume 
available to the Sloan 2.5m telescope, a volume defined 
by the area of accessible high latitude sky and by the 
maximum practical redshift depth. The BOSS survey ob- 
serves primarily at 6 > —3.75° in the SDSS high-Galactic 
latitude footprint. The spectroscopic footprint is approx- 
imately 10,000 deg”; larger areas begin to stray into re- 
gions of high Galactic extinction. Given the configura- 
tion of the BOSS experiment, we can exceed the density 
of objects in the LRG sample and observe a comoving 
space density ñ = 2 — 3 x 1074 A?Mpc ? with strongly 
clustered galaxies (bias factor b ~ 2). In one-hour ex- 
posures under good conditions, the BOSS spectrographs 
can measure redshifts of these luminous galaxies with 
high completeness to i = 19.9, which for the desired space 
density puts the outer redshift limit at z — 0.7. BOSS is 
designed to observe 1.5 million galaxies to these limits, 
including 150,000 galaxies that satisfy the BOSS selec- 


tion criteria but were previously observed during SDSS- 
I and -II. For BAO and other large scale power spec- 
trum measurements, the comoving volume of the com- 
pleted BOSS galaxy survey, weighted by the redshift- 
dependent number density and galaxy power spectrum, 
will be 6.3 times that of the SDSS-I and -II LRG sample. 
The higher density of BOSS galaxies relative to SDSS-I 
and -II LRGs also improves the performance of the re- 
construction methods discussed above that correct for 
non-linear degradation of the BAO signal. 

The BOSS quasar survey is pioneering a novel method 
of measuring BAO at high redshift (2.15 < z < 3.5) using 
Lya forest absorption towards a dense grid of background 
quasars. The redshifted Lya line becomes detectable in 
the BOSS spectral range just beyond z = 2 and becomes 
highly opaque at z > 4, motivating the targeted redshift 
range. Even at moderate resolution, transmitted flux in 
the forest provides a measure of hydrogen along the line 
of sight that can be used to infer clustering of the under- 
lying dark matter distribution 
[McDonald D003). Prior to BOSS, cosmological measure- 
ments of the forest treated each quasar sightline in isola- 
tion, since samples were too small and too sparse to mea- 
sure correlations across sightlines except for a few cases 
of close pairs or other tight configurations. 
(2003); suggested that three-dimensional 
Lya forest correlations in a large quasar survey could be 
sed io messire BAO. 
developed this idea in detail, constructing a Fisher ma- 
trix formalism that helped guide the design of the BOSS 
quasar survey. used the first year of 
BOSS data to obtain the first three dimensional mea- 
surement of large scale structure in the forest, detecting 
transverse flux correlations to comoving separations of at 
least 60^! Mpc. The goal of the BOSS quasar survey is 
to obtain spectra of at least 150,000 Lya quasars selected 
from about 400,000 targets. Extrapolations based on 
data taken to date suggest that BOSS will ultimately ob- 
serve about 175,000 Lya quasars over 10,000 deg?. Ultra- 
violet, near-infrared, and multiple-epoch optical imaging 
data increased the selection efficiency beyond the mini- 
mum goal. 

This paper is one of a series of technical papers de- 
scribing the BOSS survey. The BOSS imaging survey 
data are described Aaaa Gord. while BOSS 
is described in the context of SDSS-III in 
(2011). The selection of galaxy and quasar targets are 
described in[Padmanabhan et al] (20121) and[Ross et al 
(20128), respectively, the spectrograph design and per- 
formance in (2012). and the spectral data 
‘eduction d (2013) and 
(2012). Here, we describe the details of the BOSS sur- 
vey itself, with an emphasis on survey strategy and op- 
erations. In Q2] we present the BOSS footprint and the 
selection of the galaxies and quasars that will be used to 
measure the BAO feature. In we review the process 
by which we design the spectroscopic plates to observe 
those targets. In §4] we describe the procedures for ob- 
servation, the real-time data quality assessment, and the 
data processing pipeline. Much of the information found 
in Q3] and $4] was never described for SDSS-I or -II and 
remains unchanged from those earlier surveys. We dis- 
cuss the redshift efficiency and the strategy to complete 


observations of the full 10,000 deg? in d5] and provide 
examples of the spectral data quality in $6] Finally, we 
compare the BOSS data to SDSS spectra, highlight some 
recent BOSS science results, and provide forecasts for 
cosmological constraints in {7] 


2. SPECTROSCOPIC TARGETS 


The SDSS-I and -II imaging programs provide two 
large, contiguous regions of sky that lie away from the 
Galactic plane. Additional imaging in the Falls of 2008 
and 2009 extended this footprint and increased the sky 
volume observable with BOSS. BOSS is designed to sur- 
vey the full SDSS imaging footprint with dense spectro- 
scopic coverage over five years. The first goal of BOSS 
is a redshift survey of ~ 1.5 million luminous galaxies 
at 0.15 < z < 0.7, at a surface density of 150 deg~?. 
This sample is divided into two subsamples. The first is 
a low redshift sample at 0.15 < z < 0.43, with a median 


redshift of z = 0.3 and a surface density of 30 deg~?. 


This sample is a simple extension of the SDSS-I and -II 
TRG sani Pepe: Wea 
ties, where the brightest ~ 10 deg~? have already been 
observed by SDSS. A higher redshift sample expands the 
2dF-SDSS LRG and QSO Survey (2SLAQ: [Cannon et ai] 
galaxy selections and covers 0.43 « z « 0.7, with 
a surface density of 120 deg"? and a median redshift of 
z — 0.57. The second goal of BOSS is to survey at least 
150,000 quasar spectra at 2.15 < z « 3.5 to probe the 
intergalactic medium along each sightline through Lya 
absorption. Approximately 596 of the BOSS fibers are al- 
located to obtain spectra of targets that would otherwise 
not be included in the BOSS target selection. Dubbed 
ancillary science targets, these spectra provide the data 
for smaller research programs proposed by members of 
the collaboration. These programs are described in Ap- 


pendix 


2.1. Imaging Data 


In its first five years of operation (2000—2005), SDSS 
obtained 5-band imaging over 7600 deg? of the high 
Galactic latitude sky in the Northern Galactic hemi- 
sphere and 600 deg? along four disjoint stripes in the 
Southern hemisphere. During the falls of 2008 and 2009, 
the same camera was used to grow 
the Southern imaging to a contiguous 3100 deg? foot- 
print. As with the original SDSS imaging survey, we 
obtained CCD imaging in scans with an effective ex- 
posure time of 55 seconds in each of five filters (ugriz; 
[Fukugita et al][1998). Imaging was performed when the 
moon was below the horizon, under photometric condi- 
tions, and seeing was better than 2" in the r filter. The 
prioritization was to image north of the celestial equator 
in September and October, and south of the equator in 
November and December. This prioritization makes the 
imaging depth more uniform by roughly canceling two 
effects: the southern declinations are looking through 
more atmosphere, but we did so in the winter months 
when the atmosphere is typically more transparent. The 
seeing requirements were somewhat relaxed from SDSS 
by accepting conditions to 2", although the minimum 
scan time was increased from 0.5 to 1 hour to ensure 
sufficient astrometric and photometric calibration stars. 
The typical 50% completeness limit for detection of point 


sources is r — 22.5. 

The final SDSS imaging data set, including the new 
BOSS imaging, was released as part of DR8. These data 
were reduced with a uniform version of the photomet- 


ric pipeline (Lupton et al.| (2001 


with improved sky-subtraction described in the DR8 re- 
lease (Aihara et all2011a). The photometric calibra- 
tion (Smith et al. 


2006) has been improved with a global re-calibration 
(Padmanabhan et al][2008). The astrometric 
calibration was tied to the UCAC2 system for de- 
clinations below approximately 41° and the USNO-B 
system at higher declinations. This astrometry is less 
accurate than the earlier DR7 release or the later re- 
calibration of the entire sky in the DR9 release. The 
BOSS coordinates for all target classes corresponds to 
the DR8 astrometric system which is internally consis- 
tent and only introduces small relative offsets on the 3° 
scale of a plate. These coordinates are offset 240 mas to 
the North and 50 mas to the West relative to DR9 coor- 
dinates at higher declinations, with additional sources of 
error introducing scatter of ~ 50 mas (Ahn et_al][2012). 

These imaging data define the baseline goal for the 
BOSS spectroscopic survey. We used a trimmed subset 
called the “BOSS footprint”, which is two contiguous re- 
gions of low extinction centered in the north Galactic 
cap (NGC) and south Galactic cap (SGC). To avoid re- 
gions of high extinction, the footprint lies above +25° 
Galactic latitude for all regions except around a Right 
Ascension of 120° (b > 15°) and Right Ascension of 330° 
(b > 20°). Several 0.2°-wide strips were also excised from 
this overall footprint for the quasar targets only to reject 
imaging data when u-band amplifiers were not function- 
ing, as explained in DR2 (Abazajian et al][2004). The 
BOSS footprint is presented in Figure [i] 
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Fic. 1.— Location of pointing centers for the 2208 spectroscopic 
plates in the BOSS survey footprint in an Aitoff projection in 
J2000 equatorial coordinates. Gray circles represent the location 
of plates that remained to be drilled after Summer 2011. Blue 
circles represent plates that were drilled and were ready to observe 
in the third year of the survey. Red circles represent the plates 
that were completed in either the first or second year of the 
survey, whose observations are released in DR9. 


In total, galaxy spectroscopic targets were chosen from 
3172 deg? in the SGC and 7606 deg? in the NGC. A to- 
tal of 7578 deg? in the NGC were used for the quasar 
target selection due to the loss of u—band imaging in 
certain regions. Because this footprint is too large for 
the BOSS survey, it was was further trimmed by remov- 


TABLE 1 
SuRVEY PARAMETERS 
Region Contiguous Imaging Area for Galaxy Area for Quasar Area, after Number 
Area ( deg?) Targets (deg?) Targets (deg?) Tiling (deg?) of Plates 
SGC 3172 2663 2663 2634 633 
NGC 7606 7606 7578 7426 1575 


ing most of the area south of the celestial equator, re- 
ducing the SGC footprint to 2663 deg”. The coverage 
of the SGC is presented in Figure [I] as the region with 
a > 300? or a < 60°. The coverage of the NGC is in- 
cluded in the region with 60° < a < 300°. The spur 
in Figure [1] located at approximately 30° < a < 40° 
and 6 < —2? was chosen to overlap with the W1 region 
of the Canada-France-Hawaii Telescope Legacy Survey 
(CFHTLS; [Goranova et_al]|2009). Using ugriz photom- 
etry from these 10,269 deg?, we choose galaxy, quasar, 
and ancillary science targets for spectroscopic follow-up. 
A summary of the area covered in imaging and spec- 
troscopy is provided in Table [I] 


2.2. Imaging Pre-selection 


We briefly describe those photometric parameters mea- 
sured for each object in the imaging data that are rele- 
vant to target selection. More detailed descriptions are 
available in the online SDSS documentation and the data 
release papers, especially EDR 
and DR2 (Abazajian et al]2004). 


e PSF magnitudes are designed for point sources 
and are a fit of the point spread function (PSF) 
model evaluated at the location of each object. We 
refer to these magnitudes measured in any filter 
“X” as XpsF. 


e Fiber2 magnitudes are designed to simulate the 
flux captured by the BOSS fibers for either point 
sources or extended sources. The images are con- 
volved with Gaussians to simulate 2" seeing, and 
the flux is measured within a 2" circular aperture 
representing the size of the BOSS fibers. Unlike the 
other magnitudes that are measured on individual 
objects deblended from their neighbors, the fiber 
magnitudes include flux from those neighbors. A 
caveat of these magnitudes is for the small amount 
of SDSS imaging data taken in seeing worse than 2" 
seeing, the fiber magnitudes are measured without 
convolution or deconvolution of those images. In 
those cases, the fiber magnitudes are fainter than 
they should be. Throughout the text we refer to 
these magnitudes measured in any filter ^X" as 
Xgp2. These magnitudes are analogous to the 3” 
fiber magnitudes from SDSS which simulated the 
flux captured by the original SDSS 3" fibers. 


e Model magnitudes are designed to measure 
galaxy colors that are unbiased relative to the 
signal-to-noise ratio (SNR) of the image or unmod- 
eled substructure in the galaxies. A library of de 
Vaucouleurs (de Vaucouleurs and exponen- 
tial (Freemanl|1970) profiles with varying radii, el- 
lipticities and position angles are fit to each ob- 
ject's image. The best-fit profile from the r band, 


convolved with the PSF in each band, is used as a 
matched aperture for the model magnitudes. 


e Cmodel magnitudes (composite model) are de- 
signed to produce a good estimate of the "total" 
flux of galaxies. These magnitudes are based upon 
the best-fitting positive linear combination of ex- 
ponential and de Vaucouleurs profiles, convolved 
with the PSF. These are measured independently 
in each band. 


€ raev, is the best fit de Vaucouleurs effective radius 
in the 4 band (in units of arcseconds). 


Every object in the SDSS imaging catalogs has a se- 
ries of flags indicating the nature and quality of the data, 
allowing one to define a single unique measurement of ob- 
jects in regions observed more than once, and to identify 
any problems with processing and measuring the proper- 
ties of each object. We used these flags to remove ques- 
tionable objects from consideration for spectroscopy. 

Repeat observations and overlapping scans resulted in 
multiple observations for some objects; only unique ob- 
servations identified by the SURVEY PRIMA RY bit 
of the RESOLVE. STATUS flag assigned to each ob- 
ject were used for the main targeting algorithms, as de- 
bd (DO). Multiple imaging ob- 
servations were used for some special programs, such as 
quasars targeted based on their variability. 

For the galaxy, quasar, and standard star targets se- 
lected for the primary BAO program (denoted “main 
targets" in which follows), we rejected objects for which 
flags set during photometric pipeline processing indi- 
cated that their photometry was unreliable. All objects 
with the SATUR flag set (which indicates that the ob- 
ject includes saturated pixels) are dropped from consid- 
eration for galaxies, quasars, and standard stars. 

The quasar target selection algorithm is most sensitive 
to objects with unreliable photometry (stars are far more 
numerous, leading to a very large contamination rate), 


and thus it does the most careful checking. The flags used 
are described in detail in 
pendix A of[Ross et al] (2012b) (see also 
Boo), and include flags designed to indicate problems 
due to deblending, interpolation over cosmic rays and 
bad pixels and columns, and proper motion between ex- 
posures in different filters (due to asteroids). 

For galaxy target selection, we also use a special- 
ized set of set of flag cuts to ensure good target selec- 


tion. The detailed galaxy flags for galaxies are given in 
Guia). 

Objects flagged in the following ways were not consid- 
ered for followup as standard stars. These flag cuts are in 


addition to SURVEY. PRIMARY and SATUR listed 
above. 
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e Objects observed in non-photometric conditions 
(as indicated by the CALIB. STATUS flag) in 
any band. 


Objects labeled INTERP. CENTER, 
PSF.FLUX INTERP, indicating that interpo- 
lation over cosmic rays or bad pixels significantly 
affected the measured flux of the object. 


with de- 


Objects flagged in association 
blending of overlapping objects. Objects 
labeled as BLENDED; such objects are 
the parents of objects subsequently de- 
blended into children. Objects flagged DE- 
BLEND.TOO.MANY.PEAKS; this flag 
indicates that there were more objects to deblend 
in this family than the deblender could handle. 
Objects marked PEAKS. TOO.CLOSE, indi- 
cating the peak for this object was too close to 
another peak. 


Objects flagged CR, meaning the object contained 
a cosmic ray that was interpolated over. 


Objects flagged with BADSKY, meaning the es- 
timate of the background sky at this position is 
suspect. 


Objects labeled NOTCHECKED. CENTER, 
meaning the center of the object lies in a region 
not checked for peaks; often due to bad deblend- 
ing. 


Objects flagged as 'STATIONARY, indicating 
that the object is an asteroid. 


Further details of these flags are found in 
(2003) (2006), and 
the SDSS-III website. 

As described in (2012b), we 
tracked regions of rejected objects and bad photome- 
try to define the exact area where spectroscopy was per- 
formed. It is recommended that users apply these masks 
when performing statistical studies of large-scale struc- 
ture with the galaxy sample or investigations that require 
complete understanding of the selection function, such as 
a measurement of the quasar luminosity function. Since 
the Lya quasars provide random sightlines, it is unlikely 
that the masks will affect studies of large scale structure 
in the Lya forest. 


2.3. Galaxy Target Selection 
'The dominant cosmological volume in the SDSS-I and 


-II surveys was mapped by the Luminous Red Galaxy 
sample and the BOSS survey uses 
a similar philosophy of color-magnitude and color-color 
plots to select the galaxy sample. Unlike the LRG sam- 
ple, the higher redshift BOSS (“CMASS”, see below) 
sample is not restricted to a sample of red galaxies, but 
instead attempts to select a stellar mass-limited sample 
of objects of all intrinsic colors, with a color cut which 
selects almost exclusively on redshift. We lower the lumi- 
nosity cut relative to the LRG sample and allow a wider 
color range to achieve a higher density and also provide a 
less biased sample for studies of massive galaxy evolution. 


We therefore do not use the designation ^LRG" for these 
galaxies, even though their selection criteria are similar 
in spirit to previous LRG surveys 


The BOSS galaxies are selected to have approxi- 
mately uniform comoving number density of n — 3 x 
1074 h? Mpc ? out to a redshift z = 0.6, before monoton- 
ically decreasing to zero density at z ~ 0.8. Galaxy shot 
noise and sample variance make roughly equal contribu- 
tions to BAO errors when riPgAo = 1, where PpAo is the 
galaxy power spectrum at the BAO scale, approximately 
k = 0.2hMpc !. For the strongly clustered galaxies ob- 
served by BOSS, ñ = 3 x 1074 h?Mpc ? yields roughly 
nPgao = 2, making shot-noise clearly sub-dominant. In- 
creasing to nPgao = 2 should also improve the perfor- 
mance of the reconstruction approach described in the 
introduction. Generating a higher space density would 
require substantially more observing time with only mod- 
erate gain in clustering signal-to-noise. We therefore ex- 
pect significant improvements in BAO constraints over 
the SDSS-I and -II LRG sample (ñ = 1x 10-4 A?Mpc ?), 
even in the redshift range where the two samples overlap. 

The galaxy target selection algorithms will be doc- 
umented in detail in (20120); we 
summarize the principal points below. We select these 
targets from SDSS imaging according to the photomet- 
ric parameters from the processing pipeline. All mag- 
nitudes are corrected for Galactic extinction using the 
models of dust absorption. Target 
galaxies are chosen based on color and apparent mag- 
nitude cuts motivated by the stellar population models 
of [Maraston et al] (2009). These cuts are based on the 
expected track of a passively evolving, constant stellar 
mass galaxy as a function of redshift. 

Targets are selected based on the following set of model 
magnitude colors 
: 


cy, =0.7(g — r) + 1.2(r — i — 0.18) (1) 
c, —(r— i) — (g — r)/4 — 0.18 (2) 
dj; =(r — i) — (g — r)/8.0 . (3) 


These color combinations are designed to lie parallel or 
perpendicular to the locus of a passively evolving popu- 
lation of galaxies, with c, and cj being the appropriate 
colors to select galaxies below z ~ 0.4 and d, at higher 
redshift. To a good approximation, the perpendicular 
colors track the location of the 4000 À break, which is 
redshifted from g—band to r—band at z ^ 0.4, and pro- 
vide an initial redshift selection. 

Galaxy target selection mirrors the split at z ~ 0.4 and 
selects two principal samples: “LOWZ” and *CMASS". 
The LOWZ cut targets the redshift interval 0.15 < z « 
0.43. 'This sample includes color-selected galaxies with 
16 < r < 19.5, r < 13.6 + c,/0.3, and |c,| < 0.2 where 
r is the cmodel magnitude. Over most of the BOSS 
footprint, about one third of the LOWZ galaxies already 
have spectra from SDSS-I and -II; these objects are not 
re-observed. However, in the SGC, the SDSS-I and -II 
spectroscopy was limited primarily to low declination, re- 
sulting in a larger fraction of LOWZ targets in the South. 
Cut II (CMASS for “constant mass") includes color- 
selected galaxies in the magnitude range 17.5 < i < 19.9, 


d, > 0.55, and i < 19.86 + 1.6 x (d, — 0.8) where i is 
the cmodel magnitude. The CMASS sample is designed 
to select galaxies at 0.43 < z « 0.7, although it extends 
beyond these nominal redshift boundaries. In addition, 
to better understand the completeness of the CMASS 
galaxies and the impact of the color-magnitude cut, we 
also target CMASS.SPARSE galaxies. These galaxies 
share the same selection as the CMASS galaxies, ex- 
cept that the color-magnitude cut has been shifted to 
i < 20.144 1.6 x (dj, — 0.8). We target these galaxies at 
a rate of 5 deg ?. 

In a manner similar to that described in 
(2002), we track the source of the 
selection for galaxy objects and all spectroscopic 
targets with flag bits encoded in the new quantities 
BOSS.TARGETI1 and BOSS_TARGET2. Targets 
that satisfy the LOWZ criteria are denoted GAL. LOZ, 
while CMASS targets fall into several categories labeled 
with a GAL.CMASS. prefix. A description of all flag 
bits for the main targets is found in Appendix [A] 

present photometric stellar 
masses for a large sample of BOSS galaxies by fitting 
model spectral energy distributions to ugriz photom- 
etry. They demonstrate that the main BOSS galax- 
ies have a narrow mass distribution which peaks at 
log(.M/ M) ~ 11.3. The results confirm that the target 
selection has successfully produced a sample of galaxies 
with fairly uniform mass from 0.2 « z « 0.6. 


2.4. Quasar Target Selection 


The primary goal of the BOSS quasar survey is to map 
the large scale distribution of neutral hydrogen via ab- 
sorption in the Lyo forest. Measurements of BAO in the 
three-dimensional correlation function in this intergalac- 
tic neutral hydrogen will provide the first direct mea- 
surements of angular diameter distance and the Hubble 
parameter at redshifts z > 2. However, for the Lya for- 
est to adequately sample the three-dimensional density 
field for à BAO measurement, the BOSS quasar density 


must be an order of magnitude larger than SDSS (for 
the density of SDSS quasars, see |Schneider et al]|2010)) 
over the same redshift ra McDonald & Eisensteir 


nge ( 

IMcOuinn & Whitd D01 1). The sample must also 
provide targets with a luminosity distribution that en- 
ables adequate signal-to-noise ratio spectroscopy over the 
forest. At a minimum, 15 quasars deg ? at redshifts 
2.15 < z < 3.5 and gpsp < 22.0 are required to make this 
measurement. Since the precision of the BAO measure- 
ment shows an approximately linear improvement with 
the surface density of quasars at fixed spectroscopic SNR, 
we attempt to obtain as many quasar sightlines as possi- 
ble. Fortunately, because the quasars are nearly uncor- 
related with the intervening density field, Lya measure- 
ments are insensitive to the exact details of quasar target 
selection, and do not require the uniform sample that is 
essential to the galaxy BAO measurement. 

Extrapolating from the quasar lu- 
minosity function, we find that quasar targets must be 
selected to a magnitude limit of gpsr < 22.0 to ob- 
tain a surface density of 15 high-z quasars deg ?. How- 
ever, identifying quasar targets from photometric data 
is complicated by photometric errors and the similar- 
ity of quasar colors (particularly at z ~2.7) to colors 
of A stars and blue horizontal branch stars (e.g. 


Richards et al||2002 . This 


suggests that a more sophisticated method for target 


selection is required than that used in SDSS-I and -II 
(Richards et a1/|2002). 


In the first year of BOSS observations, we compared 
and tested a variety of methods to optimize the effi- 
ciency of quasar target selection. ‘These methods in- 
cluded: a “Kernel Density Estimation” (Richards et al. 
which measures the densities of quasars and stars 
in color-color space from training sets and uses these to 
select high probability targets; a *Likelihood" approach 
which determines the likelihood that each object is a 
quasar, given its photometry and models for the stel- 


lar and quasar loci (Kirkpatrick et alj|2011); an ^Ex- 
treme Deconvolution" (XDQSO; 2011allb) se- 


lection, which performs a density estimation of stars and 
quasars by incorporating photometric uncertainties; and 
an artificial neural network (Yéche et al] 2010), which 
takes as input the SDSS photometry and errors from a 
training set in order to run a classification scheme (star 
vs. quasar) and generate a photometric redshift esti- 
mate. XDQSO includes data from the UKIRT Infrared 
Deep Sky Survey (UKIDSS; 2007) and 
the Galaxy Evolution Explorer (GALEX; 
when available, showing the greatest improvement 
in selection efficiency with the inclusion of the UKIDSS 
data. 

As with galaxies, quasars targeted by BOSS are 
tracked with the BOSS_TARGET1 flag bits: details of 
each selection and flag are found up (20121). 
Quasar target selection falls into five distinct categories: 


e QSO. CORE.MAIN, hereafter CORE, includes 
targets selected by a uniform method that can eas- 
ily be reproduced for studies of the luminosity func- 
tion, clustering, and other quasar science. The 
XDQSO method was selected as CORE at the be- 
ginning of the second year of BOSS observations 
and applies to spectroscopic plates numbered 4191- 
4511, and 4530 and above. 


e QSO_BONUS_MAIN, hereafter BONUS, in- 
cludes targets selected in a non-uniform way to uti- 
lize the full complement of ancillary imaging data 
and maximize the spectroscopic quasar density. As 
mentioned above, heterogeneous selection does not 
bias cosmology constraints because the BAO mea- 
surement in the Lya forest is not dependent on the 
properties of the background illuminating quasar. 


e QSO KNOWN.MIDZ includes previously 
known z > 2.15 quasars, the majority of which 
are from SDSS. Given the higher throughput of 
the BOSS spectrographs, these new observations 
from BOSS provide much deeper spectra in the 
Lya forest region, and comparison to SDSS allows 
calibration tests and studies of spectroscopic 
variability. 

e QSO.FIRST.BOSS includes objects from the 
FIRST sadip survey wil ook 


ors consistent with quasars at z > 2.15. 


e QSO KNOWN.SUPPZ includes a subset of 
previously known quasars (mostly from SDSS-I and 


-II) with 1.8 < z < 2.15. Previous studies of the 
Lya power spectrum (e.g. 
indicate that metal absorption contributes a small 
amount of spurious clustering power in the Lya for- 
est. This lower redshift sample, re-observed with 
BOSS, allows a measurement of the spectral struc- 
ture from metal lines that appear in the Lya forest 
at the low redshift range of the quasar sample, al- 
lowing the excess power to be modeled and removed 
from Lya clustering measurements. 


In addition, in the limited regions where imaging 
stripes overlap in the SDSS imaging survey, quasar tar- 
gets were selected based on photometric variability, with 
color selection to choose objects at z > 2.15. These 
regions cover approximately 3096 of the BOSS foot- 
print and are identified by the eighth bit in the AN- 
CILLARY TARGET?2 flag developed for BOSS ancil- 
lary programs. This bit is referenced by name as the 
QSO.VAR.SDSS target flag and is found in plates 
numbered higher than 4953. The density of targets 
added by this method varies significantly from one re- 
gion to another, and is on average ~ 4 deg ?. These 
targets lead to an increase in the density of spectroscopi- 
cally confirmed quasars at z > 2.15 of two to three deg? 
in the overlap regions. 

The CORE, BONUS, FIRST and KNOWN sam- 
ples account for an average of 20, 18.5, 1, and 1.5 targets 
deg" ?, respectively. 40 objects deg? are targeted as can- 
didate quasars across the full survey (see Figure 8 and 
Figure 9 in[Ross et_al./2012b), although quasars were tar- 
geted at a somewhat higher density early in the survey. 
The QSO. KNOWN.SUPPZ and QSO. VAR. SDSS 
samples are used to fill unused fibers at a very low pri- 
ority, resulting in a greatly varying target density. Ob- 
servations in the second year included a near-final tar- 
get selection for quasars and produce average densities 
that range from 15 to 18 quasars deg? at z > 2.15, de- 
pending on the amount of ancillary imaging data (e.g., 
UKIDSS) and the density of stars. The selection of these 
targets and the target bits for the various quasar selec- 
tion schemes is explained in detail tales ct all (2012). 

Multi-epoch imaging data in Stripe 82 (—43? < 
a < 45?,—1.25? < ð < 1.25?) was used to determine 
an effective upper limit of quasar detection efficiency. 
By using photometric variability to select quasars, we 
found 24 quasars deg ? at z > 2.15 with gpsp < 22 
across the 220 deg? 
region. This analysis recovered quasars in the 2.5 « 
z « 3.5 redshift range that were systematically missed 
by color-selection techniques because of proximity to the 
stellar locus and larger photometric errors in single-epoch 
imaging. 


2.5. Ancillary Science Targets 


As in the original SDSS spectroscopy survey, several 
special observing programs were designed to pursue sci- 
ence goals not covered by the primary galaxy and quasar 
targets. A total of 206 special plates were dedicated to 
these programs in SDSS (as described in the DR4 pa- 
per;[Adelman-McCarthy et_al./2006). These SDSS plates 
were primarily along Stripe 82 because of the additional 
science goals enabled by multiple imaging epochs and 
because of the amount of available spectroscopic ob- 


serving time in the Fall months when that region was 
visible. In BOSS we dedicate roughly 5% of fibers on 
each plate to a new series of ancillary programs. We 
track the selection of each object with flag bits encoded 
in the quantities ANCILLARY_TARGETI and AN- 
CILLARY TARGET2. 

These ancillary programs are intended to support stud- 
ies that require sizable samples over large regions of sky, 
making them difficult to complete in conventional obser- 
vations at shared facilities. However, because these pro- 
grams are given a lower priority for target selection than 
the primary science drivers of BOSS, the sample selection 
is often not complete. These programs fall into two cate- 
gories: those using the repeat and deep imaging of Stripe 
82 and those selected from the rest of the BOSS imag- 
ing area. Spectroscopic observations are now complete 
for targets in Stripe 82 and are included in DR9. Obser- 
vations of the remaining programs will continue through 
the end of the survey and will be included in part in DR9 
and in future data releases. A description of each of the 
ancilary programs, including their scientific rationale, 
approximate density, and algorithm for target selection 
is found in Appendix Additional ancillary programs 
may appear in future BOSS observations; these will be 
documented in the corresponding SDSS Data Release pa- 
pers and in the science papers that exploit them. 


3. DESIGNING THE OBSERVATIONS 


As with the previous SDSS surveys, the BOSS spectra 
are obtained through observations of 1.5? radius spectro- 
scopic plates. The SDSS-I and -II plates accommodated 
640 fibers (180m diameter) that projected to a 3" di- 
ameter on the sky to match the profile of the main SDSS 
galaxies (Zmean ~ 0.15). Each BOSS plate contains 1000 
fibers with a smaller (1204m) 2" diameter to reduce sky 
background and match the smaller angular profile of the 
higher redshift CMASS galaxies (zmean ~ 0.57). The 
minimum allowable distance between fibers was 55" in 
SDSS and is 62" in BOSS, set by the cladding around 
each fiber. One might worry that pointing errors and see- 
ing losses would overcome the effects of reduced sky noise 
for 2" fibers, degrading the SNR relative to 3" fibers. 
We have conducted tests that show that we are in fact 
gaining on an object-by-object basis, the spectrograph 
throughput is significantly improved, and the increase in 
fiber number from 640 to 1000 (which fit in the same 
slithead design as SDSS) is a large efficiency gain. 

Science targets are assigned to these plates in a process 
referred as “tiling” (Blanton et al! . In SDSS-I and 
-II, imaging and spectroscopy were interleaved, requiring 
the tiling to progress in pieces as the imaging for each 
area of the sky sky was completed. As explained in I] 
the imaging footprint for BOSS was completed in Fall 
2009, making it possible to tile the entire 10,269 deg? 
footprint before full spectroscopic observations began. 
Changes in the target selection algorithms required occa- 
sional retilings of the survey footprint; we track changes 
in target selection as described below. After tiling, the 
plug plates are designed according to the estimated time 
(and thus airmass) of the observation. Target positions 
are converted from equatorial coordinates to (x,y) plate 
coordinates for drilling with a Computer Numerically 
Controlled milling machine at the University of Washing- 
ton. The application of tiling and plate design to spec- 
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TABLE 2 
FIBER ASSIGNMENT PRIORITY FOR RESOLVING FIBER COLLISIONS? 


Chunks Plates 
boss1-boss2 3536-3750, 4526-4529 
boss3-boss9 3751-4097 


boss10-boss13 4174-4428, 4530-4556, 
4656-4669 

boss14-boss20 4440-4525, 4557-4655, 
4670-5140 


Target Bit Priority Order 


quasar, galaxy 
galaxy, quasar 
QSO_FIRST_BOSS, galaxy, 


QSO_KNOWN-_MIDZ=CORE=BONUS 


QSO.KNOWN.MIDZ, CORE, 
QSO. FIRST. BOSS, galaxy, BONUS 


? Target types are listed in order of fiber collision priority from highest to lowest. “quasar” 
and “galaxy” refer to all target bits within those target classifications; i.e., all quasar or 
galaxy target bits have equal priority for resolving fiber collisions. A *—" symbol between 
two target bits means those target bits have equal collision priority. The priorities in the 
last row are defined for plates designed and drilled through the end of DR9 and will likely 


apply through the end of the BOSS survey. 


troscopic observations is the same as that used during the 
SDSS-I and -II surveys but was never documented. The 
process is fundamental to the reconstruction of the angu- 
lar selection function for studies of clustering in galaxies 
and quasars and is described below. 


3.1. Fiber Assignment for Science Targets 


Fibers are first allocated to science targets using the 
tiling algorithm described in (Blanton et al] (2003). 'The 
tiling process is intended to maximize the fraction of tar- 
gets that are assigned fibers (tiling completeness) while 
minimizing the number of tiles required to complete ob- 
servations (tiling efficiency) without leaving gaps of cov- 
erage in the survey footprint. The process for BOSS is 
nearly identical to that for SDSS-I and -II, except for 
the number of fibers and prioritization of targets. We 
describe the process for BOSS here. 

We begin with a uniform distribution of tiles, which are 
then perturbed to account for angular variations in the 
density of targets. Of the 1000 available fibers on each 
tile, a maximum of 900 fibers are allocated for science tar- 
gets; the rest are reserved for calibration stars and mea- 
surements of the sky background (i.e., fibers placed at 
locations without detectable objects in the SDSS imag- 
ing data). Of the science fibers, five on each plate are 
assigned to targets that are also assigned fibers on neigh- 
boring tiles. We refer to these as “repeat observations" 
and use them to test the reproducibility of spectroscopic 
results. The remaining fibers are reserved for unique 
quasar, galaxy, or ancillary targets. Most plates do not 
use all available fibers because the plates must overlap to 
avoid gaps in coverage, thereby increasing the total num- 
ber of plates and number of fibers beyond the minimum 
required by the number of science targets. The overall 
tiling efficiency of the survey, defined as the fraction of 
these 895 fibers assigned to unique, tiled science targets, 
is 0.927 with a plate-to-plate dispersion of 0.067. The re- 
maining fibers are used for lower priority targets such as 
additional repeat observations, QSO. VAR. SDSS tar- 
gets, and additional sky fibers. 

Groups of targets that are linked together within the 
62" fiber collision limit are denoted collision groups. The 
priorities for assigning fibers within a collision group vary 
with target type, and have varied throughout the sur- 
vey as shown in Table[2] The decollided set of targets 
is the subset that does not lie within 62" of any other 
target. Targets may also be precluded from fiber assign- 
ment because they lie within a 92" radius of the center 


of each tile, which is the region covered by the center- 
post of the cartridge. Given these conditions, the tiling 
algorithm attempts to maximize the number of targets 
assigned fibers. In regions covered by a single tile, fiber 
collisions limit the galaxy and quasar tiling completeness 
to ~ 90%. Once the galaxy and quasar target sets have 
been assigned fibers, ancillary targets are assigned to the 
remaining fibers. In this way, galaxy and quasar targets 
are never competing for fibers with the lower-priority an- 
cillary set. Each point of the sky may be covered by up 
to four tiles (although overlap by more than two tiles is 
fairly rare). In tile overlap regions, targets within col- 
lision groups may be assigned to multiple tiles, bring- 
ing the completeness near unity in these regions (Ta- 
ble B). The tiles and the imaging footprint define the 
geometry of the survey. We use the software package 
mangle to create and manipulate 
the survey geometry, as described briefly a eal 
(20114). 

The fiber assignment in BOSS was not performed as 
a single process over the full spectroscopic footprint; in- 
stead, small regions of the footprint were tiled at different 
times to accommodate changes in the target selection al- 
gorithms. Each of these regions is called a *chunk" and 
is denoted *bossN", where N is the chunk number. These 
chunks vary in size from a few dozen to a few hundred 
plates, depending on the region and the status of the tar- 
get selection testing at the time the plates were needed. 
Within each chunk, a sector is defined as a region covered 
by a unique set of tiles. 

While the fiber assignments changed within tiles, the 
tile centers were set around December, 2010 in boss15 
and will not change. The final tiling of the BOSS foot- 
print results in a total of 2208 plates as shown in Figure[I] 
and Table[1] 1575 plates lie in the NGC and 633 plates lie 
in the SGC. After accounting for spatial patterns in tiling 
and incomplete regions, the tiled area of the 10,269 deg? 
imaging footprint (12.1) amounts to a total of 10,060 
deg?. The mean area covered per tile is 4.7 deg? in the 
NGC and 4.2 deg? in the SGC. The higher tile density in 
the SGC is required to include approximately 20 deg? 
additional galaxies from the LOWZ sample that lie in 
regions not covered in previous SDSS spectroscopy. The 
area of the survey covered by single tiles is 5417 deg?, 
while the remaining 4643 deg? is covered by two or more 
tiles. The statistics of completeness for each target type 
are found in Table[3] Because of the connection between 
completeness and tile overlap, and the varying efficiency 


TABLE 3 
COMPLETENESS BY TARGET TYPE AND SECTOR TYPE 
Target Type Single-tile Sectors Overlapping Sectors 
Mean Dispersion? Mean Dispersion? 


Galaxy 0.891 0.010 0.988 0.001 
Quasar 0.914 0.030 0.998 0.002 
Ancillary 0.680 0.143 0.850 0.158 


a RMS computed from area-weighted chunk-to-chunk varia- 
tion in boss1 through boss20. 


among the chunks, the incompleteness has strong spatial 
patterns that must be included in clustering analyses. As 


examples, describe the method to 
account for incompleteness in galaxy clustering due to 
fiber collisions and [White et all (201 describe the pro- 
cess in a measurement of quasar clustering. 


3.2. Plate Design 


In the tiling stage described above, approximately 160- 
200 fibers per plate are dedicated to the main quasar 
targets, 560-630 to galaxy targets, and 20-90 to ancillary 
science targets. In plate design, additional fibers are as- 
signed to each plate for the purpose of sky subtraction 
and calibration. 

We assign each plate at least 80 "sky" fibers placed 
at locations that contain no detections from the SDSS 
imaging survey. A random selection of such locations 


is output from the photometric pipeline, as described in 
(2002). The sky fibers are used to model 
the spectroscopic foreground for all science fibers. Their 
distribution is constrained to cover the entire focal plane, 
by requiring at least one sky fiber per bundle of 20 fibers. 
The fibers within a bundle are anchored to a fixed point 
on a cartridge, easing the handling of fibers but limiting 
the reach of a bundle. The spacing of sky fibers allows 
sampling of the varying sky background over the focal 
plane and over the optical system of the spectrograph, 
allowing a model to be constructed for any spatial struc- 
ture and leading to improved sky subtraction. 

We assign each plate 20 “standard” fibers to sources 
with photometric properties consistent with main se- 
quence F stars, to serve as spectrophotometric standards. 
These stars allow calibration of throughput as a func- 
tion of wavelength for each exposure, including atmo- 
spheric telluric absorption corrections and the spectral 
response of the instrument. We choose these stars to 
have 15 < rgyo < 19 and Maist < 0.08, where Maist is a 
scaled distance in color space from the color of a fiducial 
F star: 


mais = |((u g) — 0.82)? + ((g — r) — 0.30? 


+ ((r — i) — 0.09)? + ((i — z) — 0.02)? "6 (4) 


We use PSF magnitudes corrected for Galactic ex- 
tinction to calculate this quantity. Typically there are 
at least 10 deg"? standard stars (or 70 per plate), al- 
though their density on the sky is a function of Galactic 
coordinates. These relatively hot stars provide a well- 
understood and mostly smooth continuum that allow the 
spectra to be calibrated. A stellar template of appropri- 
ate temperature and surface gravity is derived for each 
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star and used to derive the spectral response as described 
in [Schlegel et al] (2012). 

During plate design, we choose 16 “guide” stars for 
each plate. During observations, each hole for a guide 
star is plugged with a coherent fiber bundle that is con- 
structed with alignment pins to track orientation. T'wo 
guide stars near the center of the plate have large bun- 
dles (30" diameter) and are used to acquire the field. 
The remaining fourteen guide stars use smaller bun- 
dles (10" diameter) and are used to guide during ex- 
posures. We choose guide stars from the SDSS imaging 
with 13 < gpgr < 16.5 and colors 0.3 < g — r < 14, 
0 <r-—i < 0.7 and —0.4 < i —z < 1 determined 
from PSF magnitudes. The positions are corrected for 
proper motion when data from the US Naval Observa- 
tory catalog (USNO-B; are available. 
We use the DR8 version of the SDSS astrometry both 
for the guide stars and the science targets. Because of 
errors in that astrometry, there are some offsets at de- 
clinations 6 > +41°. However, as [Aihara et al] 
explains, these offsets are coherent on large scales, and 
the expected contribution of these errors on three degree 
scales (corresponding to the plate size) is 60 mas per star, 
well below our tolerances. Finally, we drill holes (3.175 
mm, corresponding to 52.5" diameter) at the locations 
of bright stars to minimize light scattered from the sur- 
face of the plug plate. Stars brighter than my = 7.5 
are chosen from the Tycho-2 catalog (Hog et all 2000). 
These light trap holes are not plugged with fibers (they 
are larger size to prevent accidental plugging) and allow 
light to pass through the focal plane unobstructed. 

The fibers assigned to standard stars, guide stars, and 
sky fibers are distributed uniformly over each plate to 
ensure consistent data quality for all spectra, regardless 
of their position in the focal plane. Because the fibers in 
the instrument are distributed in 50 bundles of 20 fibers 
each, we assign each hole to a particular bundle for ease 
of plugging. Unlike the cartridges used in SDSS-I and 
-II, half of the BOSS fibers are colored red and half blue; 
the red and blue fibers alternate in position along the 
slit head. By tracking the holes associated with each tar- 
get, we assign red fibers preferentially to CMASS targets, 
which are typically very faint at short wavelengths. By 
doing so, we minimize cross-talk on neighboring fibers in- 
side the spectrograph between quasars and other quasars, 
low-redshift galaxies, or ancillary targets that have sub- 
stantial blue light (since they are usually separated by 
at least one fiber on the CCD). Most bundles do not 
have exactly ten high-redshift galaxies for the ten red 
fibers, so the extra high-redshift galaxies are assigned 
to blue fibers, or the extra red fibers are assigned to 
other objects. Other than the bundle assignment and the 
red/blue fiber designation, there is no requirement that 
particular fibers be placed in particular holes. As de- 
scribed in §4.1]the exact hole where each fiber is plugged 
is determined shortly before observations. 

In planning the survey, we estimate the Local Sidereal 
Time (LST) at which each plate will be observed. We use 
the corresponding hour angle and altitude predictions of 
the field during observation and determine each hole po- 
sition on the plate accounting for atmospheric differential 
refraction (ADR). For galaxies, standard stars, or ancil- 
lary targets, we center the hole position at the optimal 
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focus and (x,y) position for 5400 A light. Since ADR de- 
pends on wavelength, redder or bluer light will be offset 
in (z,y) coordinates away from the center of the fiber. 
For quasars, we center the hole position on the 4000 À 
light to maximize the SNR in the Lya forest. This differ- 
ence corresponds to about 0.5" typically, and the overall 
throughput difference at wavelengths shorter than 4000 
A can be about 50%. In the DR9 data model (plat- 
eDesign and spAll files), the quantity LAMBDA. EFF 
records the wavelength for which the hole position of each 
object was optimized. 

In addition to the wavelength-dependent ADR. offset, 
we also account for the wavelength dependence of the 
focal plane when observing the quasar targets. The fo- 
cal plane for 4000 A light differs in the z-direction from 
the focal plane for 5400 À light by 0-300 microns, de- 
pending on the distance from the center of the plate. To 
account for this difference, small, sticky washers are ad- 
hered to the back of the plate for quasar targets, where 
the fibers are plugged. The washer causes the fiber tip 
to sit slightly behind the 5400 À focus. No washers are 
used for holes within 1.02? of the plate center. Between 
1.02? and 1.34°, 175um washers are used, and between 
1.34? and 1.49°, 3004m washers are used. These washers 
only became available after Modified Julian Date (MJD) 
55441 (September 2, 2010), and were not consistently 
used until MJD 55474 (October 5, 2010). In the DR9 
data model, the quantity ZOFFSET (plateDesign and 
spAll files) records the intended usage of washers, but not 
the actual usage. The exact washer usage for each obser- 
vation during this transition period (including plates ob- 
served both before and after) is documented on the pub- 
licly available software webpagd"7. 'The discrepancy will 
be resolved with DR10 in the summer of 2013. By op- 
timizing the focal plane position, and thus the SNR, for 
4000 À light, we are also perturbing the spectrophotom- 
etry relative to the standard stars as discussed in 46.1| 
Only the main quasar targets are optimized for 4000 À 
focal plane and ADR offsets in DR9. QSO. VAR.SDSS 
targets, and a few other programs assigned the ancillary 
target flags, will be similarly affected in the future. Oth- 
erwise, the plate design remains the same as it was in the 
SDSS-I and -II surveys (Stoughton et a1][2002]). 

During exposures, the guider adjusts the offsets and 
plate scale according to changes in ADR, and adjusts ro- 
tation according to changes in altitude and azimuth. In 
addition, thermal expansion of the plate due to temper- 
ature changes and stellar aberration create purely radial 
shifts in the position of objects, these effects are corrected 
with changes to plate scale as predicted by the guider by 
adjusting the primary mirror axially and refocusing the 
secondary mirror. At the design hour angle, all guide 
star and science target images will be centered in each 
fiber. However, because observations typically begin be- 
fore the design hour angle and complete after the design 
hour angle, the image of each object will drift across the 
center of each fiber, and there will be no adjustment the 
guider can make to center all of the guide stars. To com- 
pensate, we apply changes to the plate scale to minimize 
the effect. The differential change in position of the im- 
age centers across the plate constrains the hour angles 


77 www.sdss3.org/svn/repo/idlspec2d/trunk/opfiles/washers.par 


over which the plate is observable. We define the plate 
observability window such that the maximum offset of 
any hole relative to its image (in perfect guiding at the 
design wavelength, 5400 À or 4000 À) is less than 0.3". 
'The typical visibility window lasts more than two hours. 
'This window is longest for plates designed to be observed 
at transit, and it gets progressively longer at higher dec- 
lination, where the rate of change in airmass with time 
is slower. 


4. SPECTROSCOPIC OBSERVATIONS AND DATA 
REDUCTION 


'The plates are designed as explained in the previous 
section and machined at the University of Washington 
months in advance of the observations. The plates are 
prepared by the staff at APO before observations begin. 
Experience from the earlier surveys motivated the pro- 
cedures below to produce a survey of uniform coverage 
and data quality. The process of plate drilling, observ- 
ing, and data processing is nearly identical to SDSS but 
described for the first time here. 


4.1. Plate Drilling and Preparation 


Plates are drilled at a machine shop operated by the 
University of Washington, where up to eight plates can 
be drilled in one day. The plug-plates are an aluminum 
alloy, 3.2 mm thick, 0.813 m in diameter and weigh 4.3 
kg. Because the telescope focal plane is not perfectly flat, 
the plates are deformed during drilling to align the hole 
axes with the optical axes. When the plate is observed, 
it is similarly deformed by the cartridge to match the 
best-focus surface. Typical drilling position errors are 
« 0.15" RMS, although during observations the exact 
angle at which the fiber rests in the hole can contribute 
larger errors in the focal plane position. 

Plates are shipped to APO where they are received, un- 
packed, and “marked”. In the marking stage, the original 
plate design is projected onto the drilled aluminum plate. 
Using felt-tipped markers, the staff at APO trace the 
groupings of 20 fibers in a bundle from the plate design 
onto the aluminum plate to ensure that all 1000 bundled 
fibers can reach all 1000 holes. Following the projected 
plate design, the APO staff install washers that are man- 
ufactured with an adhesive on one side around the holes 
of quasar fibers that need to be offset from the focal 
plane. They also mark the locations of holes for guide 
star fibers. The entire process takes around 30 minutes; 
an image of a marked plate is shown in Figure In 
SDSS-I and -II, plates were not observed during bright 
time and the staff at APO had more time to mark plates. 
In SDSS-III, the staff at APO are typically occupied with 
Marvels and APOGEE observations in bright time. We 
therefore typically try to complete preparation of a BOSS 
plate at least one month before it is observed to ease the 
scheduling of marking and observations. However, early 
commissioning plates and changes in target selection of- 
ten forced last-minute shipment and plate preparation 
during the first year. 

Once a plate is marked, a database is updated to in- 
dicate that it is available for observation. When a plate 
is chosen for observation on a given night based on its 
visibility window, a message is sent from the nighttime 
staff to the daytime staff requesting that the fibers on 
this plate be “plugged”. In the plugging stage, two staff 


Fic. 2.— Image of plate 3552 immediately after the marking 
stage. Bundles are separated by black bounded edges, and holes 
are marked blue to reduce contamination between nearby emission 
line galaxies or quasars. Holes for guide star fibers are marked in 
black and denoted by the corresponding number ranging from 1-16. 


members install the plate onto the cartridge and plug 
fibers from each bundle into the appropriate region of 
marked holes. When the plugging is complete, they mea- 
sure the profile of the plate surface to confirm that the 
cartridge has deformed the plate to properly match the 
surface of the telescope focal plane. Finally, a custom- 
designed machine maps the location of the fibers in the 
focal plane by illuminating the output of each fiber with 
a light and recording the position of the corresponding 
hole with a camera that is focused on the surface of the 
plug plate. Repeat tests have proven the mapping pro- 
cess to be extremely reliable. Fibers are occasionally 
broken, and after plugging occasionally a fiber slips out 
of its hole enough to not receive light. Such cases occur 
~ 2 times per plate, are detected during mapping and 
recorded in the spectroscopic reductions. The process of 
plugging and mapping typically takes about 45 minutes. 
A full description of the marking and mapping is found 
in (Smee et all (2012). Up to nine BOSS cartridges can 
be plugged at any time. 


4.2. Procedures During Observations 


BOSS observations are performed by a rotating staff 
of two or three nighttime staff members known as ob- 
servers. Preparations for observing begin in the after- 
noon; one observer checks the status of all instruments, 
including CCD temperatures and controller connections, 
followed by a bias exposure to confirm that all four de- 
tectors (two for each spectrograph) are functioning cor- 
rectly. On nights when BOSS observations are planned, 
the afternoon observer focuses the spectrographs using a 
pair of Neon-Mercury-Cadmium arc exposures behind a 
Hartmann mask. The Hartmann mask successively ob- 
structs the top half and the bottom half of the colli- 
mated beam. A simple analysis of the cross-correlation 
between the two arc exposures reveals the optical cor- 
rection required to obtain optimal spectrograph focus. 
The required correction to the total path length can be 
applied with an adjustment to the position of each colli- 
mator mirror, thereby changing the focus on the red and 
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blue cameras by an identical amount. The afternoon 
adjustment confirms the performance of the instrument; 
the collimator mirrors are adjusted throughout the night 
each time observations begin for a new plate. In cases 
where there is a differential in focus between the two 
cameras, the correction is applied through adjustment 
of the focus ring on the blue camera, changing the path 
length to the blue detector with no impact on the focus 
on the red side of the spectrograph. Because the blue 
focus rings must be moved manually, they are typically 
only adjusted during the afternoon. Using typical nightly 
temperature changes and associated focus drift, the blue 
focus rings are adjusted every afternoon to compensate 
for the decrease in temperature and minimize the effect 
of differential focus over the course of the night. Cur- 
rently, the predictive adjustment is —4° on the B1 focus 
ring and +8° on the B2 focus ring. Following the focus 
routine, a five second arc and a 30 second flat field ex- 
posure are obtained to measure the spectral profile and 
confirm the focus of the instrument. 

When the sun is ~ 12° below the horizon in the 
evening, the observers fill the liquid nitrogen recepta- 
cles in all four cameras and open the telescope enclosure. 
The cartridge with the first scheduled plate of the night is 
mounted to the telescope, signaling the start of observa- 
tions. A script is run to automatically slew the telescope 
and perform calibration routines in the following order: 


e Close the eight flat-field petals 
that obstruct most of the opening in the wind baffle 
and provide a reflective surface for the calibration 
lamps. 


e Turn on Neon-Mercury-Cadmium arc lamps; they 
require several minutes to warm up. 


e Slew telescope to field. 


e Take a pair of exposures through the Hartmann 
masks using a shorter readout of the detector on 
a selected sub-region centered on well-known arc 
emission lines. 


e Adjust collimator positions to account for change 
in focus since afternoon checkout and cartridge 
change. Because focus rings are not adjusted dur- 
ing the night, the median between the best blue 
and red focus is used to adjust the collimator mir- 
rors accordingly. 


e Acquire five second calibration arcs and 30 second 
calibration flats with a quartz lamp. 


e Open petals and take the first guider exposure 
through coherent fiber bundles. 


When the first guider exposure is complete, the ob- 
servers center the telescope on the field, using the ob- 
served guide stars. Once the guide stars are roughly cen- 
tered in their respective fiber bundles, closed loop guiding 
begins and the plate scale of the telescope, the rotation of 
the telescope relative to the field, offsets in altitude and 
azimuth, and focus are adjusted. At this point, the light 
from the targeted objects is optimally focused on the 
fibers and the observers begin a sequence of 15 minute 
science exposures. When observations of the plate are 
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completed according to the criteria described below, the 
telescope is moved back to the zenith, the cartridge re- 
placed with the next scheduled plugged cartridge, and 
the series of field acquisition, calibration, and exposures 
is repeated. The entire process takes about 15 minutes 
between cartridges. In the first year of the survey, ap- 
proximately 6596 of the available time for observing was 
spent with the shutter open on science targets. In the sec- 
ond year, we automated the scripting of field acquisition 
as described above and improved to 75-7896 efficiency, 
depending on the amount of exposure time required for 
each plate. 

Ideally, we would acquire signal on each plate until the 
full data reductions reveal a data quality that exceeds 
the threshold required to meet the survey requirements. 
However, the full reductions take much longer than the 
actual observations, so we instead use a suite of simple 
reductions performed in real time to provide quick feed- 
back to the observers. We refer to these data reductions 
as the “Son of Spectro” (SOS) reductions, referring to 
the full spectroscopic reduction pipeline. At all times, 
a daemon automatically identifies new images as they 
appear on disk and submits them for flat field calibra- 
tion, wavelength solution, and the extraction of the one- 
dimensional spectra for each fiber. The SOS reductions 
perform the same functions as the spectral extraction 
in the full data reduction pipeline (Schlegel et al]D013), 
but they use simpler sky subtraction, use box-car extrac- 
tion to create one dimensional spectra, and perform no 
redshift or object classification. Comparing these quick 
reductions to the full reductions on the data, we find a 
reduction of approximately 2096 in the SOS SNR for R1 
and R2 and 1096 in SNR for B1 and B2. 

Approximately five minutes are required to process a 
single flat field exposure and one minute is required for 
arc exposures or science exposures. The results are con- 
veyed to the observers on a webpage that automatically 
updates every minute. The webpage provides simple 
metrics for wavelength coverage, spectral resolution, pro- 
file of each fiber on the CCD, and, most importantly, the 
median SNR per pixel over a synthetic i—band bandpass 
filter for the red cameras and over a synthetic g—band 
bandpass filter for the blue cameras. Statistics of this 
SNR as a function of magnitude allow diagnostics of 
problems in the data such as focus or guiding errors. The 
magnitudes are estimated from the SDSS imaging data 
and corrected for Galactic extinction using the maps of 
dust infrared emission from [Schlegel et al] (1998), scaled 
to optical wavelengths using the model of|Cardelli et al. 
An example of the SNR portion of the report 
generated for a composite of four 15-minute exposures 
on a single plate is shown in Figure [3] 

The observers use the SOS reports to diagnose perfor- 
mance such as the quality of focus, the fraction of fibers 
that dropped from the plug plates (typically fewer than 
two or three on a plate), and the rate at which signal is 
acquired. They use the accumulated SNR evaluated at 
a fiducial magnitude in the sky-dominated regime to de- 
termine when a plate is complete. As shown in the left 
hand panels of Figure [3] a power law representing the 
SNR as a function of magnitude is fit to determine SNR 
at the fiducial values of ggp2 = 22 and igng = 21. The 
SNR? scales linearly with exposure time and is therefore 
easier to use when estimating remaining exposure times. 


As explained in $5] the SNR? of the exposures at these 
magnitudes is chosen to balance high redshift complete- 
ness for the galaxy sample with the goal of observing 
10,000 deg? in five years. The observers remove the car- 
tridge when the plate is complete by the SNR criteria 
and request a new plate to be plugged in that cartridge 
the following day. Because of limited visibility windows, 
it is not uncommon that observations must cease before 
a plate is complete. In these cases, the plate remains in 
the cartridge and is observed the next clear night with 
a new set of calibration frames. The exposures taken on 
all nights are considered for determining when a plate is 
complete. 


4.3. BOSS Data Reduction Pipeline 


At the end of each night of observing, the data are sent 
to LBNL to be processed. An automated software rou- 
tine checks the data transfer status every 15 minutes and 
begins the processing when all of the data have arrived. 
The data processing jobs are organized by plate such that 
each job extracts, calibrates, coadds, classifies, and fits 
the redshift of all 1000 spectra using all exposures of a 
single plate, including exposures taken on different nights 
with the same plugging of fibers. The data are reduced 
first by collapsing them from the two-dimensional image 
into one-dimensional spectra (Schlegel et al] [2012]. In 
the second step, the one-dimensional spectra are classi- 
fied into object types and redshift (Bolton et al][2012). 
The software is written primarily in Interactive Data 
Language (IDL]^7] and is collectively referred to as “idl- 
spec2d”. This processing typically begins mid-afternoon, 
and the results are available to the collaboration by the 
following morning. 

Raw CCD frames are pre-processed by subtracting a 
bias model, their bias overscan, subtracting a dark cur- 
rent model, and dividing by a pixel flat-field model for 
each CCD. The models for bias and dark current are de- 
rived from calibration images taken periodically through- 
out the survey. The pixel flat-field images are com- 
puted about once every six months using a specially de- 
signed slithead that illuminates the entire detector with- 
out fiber-to-fiber spatial structure. The read noise is 
measured from the bias overscan region for each CCD 
amplifier for each exposure. Per-pixel variance is esti- 
mated using the measured read noise and the observed 
photon counts in each pixel. The inverse variance is mul- 
tiplied by a known CCD defect mask, and cosmic rays 
are identified to mask affected pixels. 

The spectra are extracted from the two-dimensional 
CCD frames into a set of one-dimensional spectra. The 
quartz lamp flat-field spectral images establish the cross- 
dispersion profile and initial location of the spectral 
traces as projected onto the CCDs. The locations are 
then shifted to match the science frame spectra to ac- 
count for instrument flexure from the time of the cali- 
bration exposures to the science exposures. These offsets 
are typically 0.1 A, but can be as large as 1.5 A. 

The spectra are extracted in groups of 20 according to 
their respective fiber bundles. We use a noise-weighted 
optimal extraction algorithm with the 
Gaussian profile cross dispersion widths measured in the 
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FiG. 3.— An example of the diagnostic from a series of four science exposures of plate 3775 that are produced in the quick reductions 


of data at APO. Similar plots become available to the observers less than five minutes after the end of an exposure. Left: Cumulative 
signal-to-noise ratio (S/N in the figure captions) as a function of ggp2 (top) and igpa (bottom). Fibers from the two spectrographs are 
indicated by the blue symbol “x” (spectrograph 1) and a magenta square (spectrograph 2). The intercept of the linear fit of log(S/N) as 
a function of magnitude and RMS of that fit are computed for each spectrograph separately. Only fibers with fiber2 magnitudes in the 
range 21-22 (20-21) for the blue (red) cameras are used in the fit as indicated by the vertical dotted lines. The slope is held fixed at —0.3 
as empirically determined from fits to the larger sample. This magnitude range is near the region of sky-limited noise and near the faint 
end of the main galaxy and quasar samples for the red and blue cameras respectively. Right: The spatial profile of SNR over the plate. 
Red symbols represent fibers that fall below the best fit of the SNR linear solution that is represented by the solid line in the left hand 
panels. Green symbols represent fibers that fall above the best linear fit of the SNR solution. The size of the symbol relates to the amount 
by which the fiber deviates, growing larger for fibers with larger deviation from the best fit. Guiding problems or other systematics can 
appear as coherent structure in this diagram. The distribution of objects that fall below the best-fit line is fairly uniform, indicating a lack 


of such effects for this particular plate. 
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flat field spectra. A linear cross-dispersion background 
term is included for each bundle to account for scattered 
light not described by the two-dimensional Gaussian pro- 
file. The centroids and widths of the Gaussians are fixed, 
and the extraction solves only for the amplitudes at each 
wavelength and the linear background coefficients. 

'The wavelength solution is initially estimated from the 
extracted arc-lamp spectra, then shifted to match the 
observed sky lines in each science exposure. Since the 
extraction is performed in the native pixel spacing of 
the CCD, this results in individual spectral bins that 
are statistically independent but not perfectly aligned in 
wavelength between spectra or between exposures. The 
extracted science spectra from individual exposures are 
then divided by the extracted flat-field spectra to correct 
for fiber-to-fiber throughput variations. Sky subtraction 
is performed using a model for background derived from 
the sky fibers that were assigned during plate design. 
'The background varies with fiber position to account for 
smoothly varying differences across the focal plane. Sim- 
ilarly, the spectral response (i.e., the flux calibration) is 
determined over the focal plane using models fit to the 
spectra from the standard stars that were assigned in 
plate design. 

Finally, the spectra from individual exposures are com- 
bined into a coadded frame for each fiber on a resampled 
grid that is linear in log(A). Data from both red and blue 
cameras is used in the coadded frame, generating spectra 
that cover the full 361 nm — 1014 nm wavelength range 
of the instrument. The estimated pixel variance is prop- 
agated into variance estimates of the extracted and co- 
added spectra, but the covariance terms between differ- 
ent spectra are discarded. The current pipeline performs 
this extraction as a single pass, resulting in a known bias 
due to estimating the variance from the data rather than 
iteratively updating the noise model with the statistics of 
the extracted spectra. In the limit of zero flux, the tech- 
nique systematically assigns a slightly larger variance to 
pixels that fluctuate toward higher flux values, leading to 
a weighted mean that is suppressed below its true value. 
This bias is not corrected in the DR9 sample, but may 
be addressed in a future data release. The process is 
performed independently for objects observed on differ- 
ent fibers or different pluggings of a plate, and coadded 
spectra are recorded for all observations. Spectra from 
objects that are observed multiple times are evaluated, 
and the observation that produces the best best avail- 
able unique set of spectra for the object is identified with 
the SPECPRIMARY flag, as described in the online 
documentation}. Further details about this extraction 
pipeline are published in [Schlegel et al] (2012). 

As described in|Aihara et al] (2011al), star, galaxy, and 
quasar templates are fit to the combined one-dimensional 
spectra to determine the classification and redshift of 
each object. Redshift classification errors for all ob- 
jects are reported by the ZWARNING bitmask key- 
word. We found that removing the quasar templates 
from the fits to the galaxy sample reduces the number 
of cases of catastrophic failures and classification confu- 
sion. Allowing only galaxy and stellar templates, we de- 
termine a second redshift for objects targeted as galaxies 
and report the redshift and classification errors with the 
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Z NOQSO and ZWARNING_NOQSO keywords, re- 
spectively. Further details about the redshift and clas- 
sification of BOSS spectra are published in [Bolton et al] 
(20120). 

Throughout the survey, continuous improvements have 
been made to the spectroscopic data processing pipeline. 
Every few months a new version is tagged and all of the 
data are re-processed starting from the raw data. These 
tagged versions are released to the collaboration for in- 
ternal use and the daily processing continues with the 
latest internally released tag. The majority of process- 
ing time is spent on redshift and object classification due 
to the large range of redshift over which templates must 
be fit. The total processing takes 8 — 12 hours per plate, 
although batch jobs allow an entire data set to be pro- 
cessed in a few days. Version v54_45 of the idlspec2d 
pipeline code is used to report the results from DR9. 

Data reduction results are mirrored to New York Uni- 
versity (NYU) on a daily basis to serve as an offsite 
backup. In addition, the daily reductions are processed 
in parallel on a Linux cluster at the University of Utah 
(UU), thus ensuring continuously tested alternate pro- 
cessing capability. The standard data flow is from APO 
to LBNL to NYU and UU; approximately once a year 
alternate paths are tested (e.g., APO to NYU to LBNL 
and UU). Disaster recovery plans with a 24 — 48 hour 
turnaround have been developed and tested, with scenar- 
ios ranging from the outage of a single disk to permanent 
loss of the LBNL computing center. 

DR9 includes all of the spectral products available to 
the BOSS collaboration: coadded and individual expo- 
sure flux calibrated spectra, inverse variances per pixel, 
masks, subtracted sky, calibration vectors, and model 
fits. Intermediate data products such as uncalibrated 
spectra and extracted arc/flat lamp spectra are also 
available. Catalog data such as redshifts, classifications, 
astrometry, and quality flags are available via the SDSS- 
III Catalog Archive Server database and in a fits file bi- 
nary table (spAll-v5_4_45.fits). The spectra themselves 
are available in a format of one plate-mjd-fiber per file, 
or in a bundled format of all spectra of all objects for 


each plate-mjd. Details for accessing these data are at 
and Aim et al] (2012. 
5. PROJECTIONS FOR COMPLETING THE SURVEY 

BOSS has been allocated the majority of the dark 
and gray time in the SDSS-III survey since Fall 2009. 
Bright time and a fraction of gray time are used for high 
resolution spectroscopy of Milky Way stars to search 
for extrasolar planets (MARVELS; and 
to measure abundances in evolved, late-type stars from 
H —band spectroscopy to study evolution of the Milky 
Way galaxy (APOGEE; Majewski et al., in prep). These 
programs are less sensitive to the increased sky back- 
ground. On nights when the moon is illuminated at less 
than 39%, all of the observing time is allocated to BOSS. 
On nights when the moon is illuminated at more than 
5696, time is given to BOSS when the moon is below 
the horizon. For nights when the moon is between 3996 
and 5696 illumination, the allocation of time depends on 
the time of year. In these phases, BOSS is only given 
time when the moon is below the horizon when the NGC 
cannot be observed; BOSS is given the full nights when 
the NGC can be observed. Time is only split between 
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Fic. 4.— Left: Distribution of plates in the BOSS footprint binned in 15 minute increments in Right Ascension. Right: Number of 
hours of observing time available as a function of LST (black line) and the simulated time required at each LST to observe the full survey 
(blue line). The observing time is sampled in 15 minute increments and assumes a uniformly distributed 45% efficiency after weather loss. 
The discrete sum of the entries under the black line is equal to 3585 hours. The final simulated LST distribution shown in blue is discussed 


in $5.5] 


projects on a given night if there are at least 1.4 hours 
allocated to each survey. 

Based on historical records, we assume that 5596 of 
the time will be lost to bad weather and that most of 
the months of July and August will be lost to summer 
monsoon. Taking the total amount of time allocated to 
BOSS and the expected weather loss, we anticipate ap- 
proximately 3600 hours of observation over the course of 
the survey from December 2009 through July 2014. The 
distribution of this time as a function of LST is shown 
in Figure 4] To maximize the survey volume to probe 
BAO with galaxies and Lya quasars, our goal is to ob- 
serve the entire 10,060 deg? footprint tiled with targets 
from the SDSS imaging program. Given the total num- 
ber of spectroscopic plates over this area, and assuming 
perfect scheduling, the average target can be observed for 
1.62 hours including overhead due to field acquisition and 
calibration exposures. It is essential to characterize the 
on-sky performance of BOSS to maximize the efficiency 
of these integrations over the full five-year survey. 

A comparison of the left- and right-hand panels of Fig- 
ure [4] quickly reveals that the distribution of plates in 
Right Ascension is not perfectly matched to the time 
available to BOSS over the course of the survey. The 
differences are particularly large in the intervals 3 hr « 
LST « 7 hr and 17.5 hr « LST « 21 hr when the Galac- 
tic plane passes directly overhead. We must then observe 
the high Galactic latitude BOSS plates at these times at 
non-zero hour angles. Hour angles are chosen to optimize 
LST coverage while minimizing the additional observa- 
tion time required to account for the higher airmass. 

Observations are planned at a high target density to 
measure the BAO signal, obtain high redshift complete- 
ness, and finish the survey (Figure[I] in the allotted time. 
We quantify redshift completeness in the context of the 
value of SNR? reported by the SOS reductions and find 
that exposures must be at least as deep as SNR? > 20 
on the red cameras and SNR? > 10 on the blue cameras 
to obtain the desired completeness in galaxy redshifts. 
We estimate the maximum exposure time that allows 
completion of the full BOSS footprint and find that we 
have time to allow for slightly deeper exposures than that 


minimum requirement. In this section, we describe the 
tension between these two constraints and our analysis to 
determine the optimal hour angles and SNR? thresholds. 


5.1. Galaxy Redshift Completeness 


Meeting the projections for BAO constraints from the 
galaxy sample described in 42.3] requires that we mea- 
sure galaxy redshifts for >94% of the targets, where 
the remainder are either not galaxies or are recognized 
redshift-fitting failures. For comparison, the SDSS main 
and LRG spectroscopic samples had a redshift success 
rate of >99% for galaxies at 0 < z < 0.45 
[2002]. We first identify a SNR? threshold at the fiducial 
magnitude for determining the completion of a plate that 
realizes this 94% redshift success rate for galaxies. 

In the first year of BOSS operations, we lacked the in- 
formation to empirically determine the quality of a typi- 
cal exposure or the depth of the data required to obtain 
redshifts for faint targets. We thus intentionally chose 
a threshold that produced deeper data than would al- 
low us to finish the survey in the five year window, with 
the idea of adjusting the thresholds once redshift suc- 
cess rates could be quantified. Specifically, we chose a 
threshold of SNR? > 16 in the blue cameras evaluated 
at gab2 = 22, and SNR? > 26 for the red cameras eval- 
uated at igpg = 21. For nights when the plate visibility 
window expired before meeting these thresholds, a plate 
was considered complete if it reached SNR? > 13 and 
SNR? > 22 on the blue and red cameras, respectively. 
Plates that did not reach these thresholds were kept in 
their cartridges and observed on the next clear night. 

To quantify the minimum SNR? to reach the survey 
goals, observations from the first year were artificially 
degraded by removing 3396 of the 15 minute exposures 
on each plate. Exposure depths for each plate were de- 
termined from the remaining subset of exposures using 
the SOS reductions. The exposures were combined as ex- 
plained in $4.3| and evaluated for redshift completeness. 
A successful classification of a LOWZ or CMASS galaxy 
target is one that produces ZWARNING.NOQSO 
= 0. A reduction in redshift completeness becomes evi- 
dent at SNR? < 20 at the fiducial magnitude on the red 
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TABLE 4 
SPECTROSCOPIC CLASSIFICATION IN SUBSETS OF FIRST YEAR DATA 


Data Subsample 


LOWZ 

CMASS 

CMASS (isb2 < 21.5) 
CMASS (21.5 < ifb2 < 21.7) 
CMASS (igno € 21.7) 


cameras and SNR? < 10 on the blue cameras. The red- 
shift completeness of the LOWZ sample and the bright- 
est objects in the CMASS sample are only marginally 
impacted by the reduced SNR?, but the faintest objects 
in the CMASS sample (ig; > 21.5) are quite sensitive 
to the SNR?. 

The results imply that each exposure must satisfy 
SNR? > 20 for the red cameras and SNR? > 10 for the 
blue cameras to obtain a reliable reliable classification of 
the CMASS objects. These SNR? values define the ab- 
solute minimum threshold that can be used, beginning 
in the second year when the observing procedures were 
updated. Table [4| shows the impacts of applying these 
thresholds on the redshift success rate for various subsets 
of the data. When applying this minimum SNR thresh- 
old, we find that 94% of the CMASS sample is classified 
with ZWARNING.NOQSO- 0. However, less than 
65% of the subset of CMASS targets with ifp? > 21.7 
are classified successfully and only 8596 of the subset 
of CMASS targets with 21.5 < igpe < 21.7 are classi- 
fied successfully. The lower efficiency for targets with 
igpg > 21.5 led to the decision to remove these targets 
from the galaxy target selection after the first year, effec- 
tively reducing the size of the CMASS sample by 5.2%. 
'The remaining CMASS targets should be classified at 
an efficiency greater than 9696 as long as all plates are 
observed to a depth SNR? > 20 and SNR? > 10. 

These completeness estimates are based on the cur- 
rent spectroscopic data reduction, which does a good 
but not perfect job of spectral extraction and classi- 
fication. We are continuing an effort to implement 
the “spectro-perfectionism” spectral extraction scheme 
and to develop new templates 
for classifying galaxies and quasars. We anticipate that 
the redshift completeness will ultimately exceed the per- 
formance in DR9. Under good observing conditions at 
low airmass and low Galactic extinction, the SNR? > 20 
condition is more demanding than the SNR? > 10 con- 
dition. However, if the blue criterion is more stringent, 
then some plates at high airmass or low Galactic latitude 
take a very long time to complete. Figure B] shows the 
distribution of SNR? and SNR? in BOSS observations 
taken in the first year. Note that the SNR? typically 
exceeds the threshold (SNR? > 26 for year one) by a 
small margin because exposures are taken in 15 minute 
intervals, even when the plate is close to threshold. Some 
plates are well above threshold because we occasionally 
had too few plates available for a given LST and therefore 
had to “overcook” the plates that we had. 


5.2. Quasar Identification and Lya Forest SNR 


While the performance of the galaxy component of 
BOSS is easily captured by the redshift efficiency, the 
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Fic. 5.— The fraction of completed plates as a function of SNR? 
as reported by SOS. The SNR? for the first year of data is shown as 
a solid line while the SNR? for the simulated survey, with somewhat 
lower thresholds as described in is shown as the broken line. 
In both cases, the SNR? for the red cameras is shown in red while 
the SNR? for blue cameras is shown in blue. 


performance of the quasar component is complicated 
by the fact that the entire Lya forest region of each 
quasar spectrum is important. While an increase in 
exposure times leads to deeper spectra in the Lya for- 
est, the extra time also reduces the total sky coverage. 
When we considered changing the exposure depths af- 
ter the first year of BOSS, scaling arguments based on 
indicated that for BAO 
studies, the gains from additional area would exceed the 
losses from lower SNR in the Lya forest. In a more thor- 
ough recent investigation, ad- 
dress the question using detailed simulated Lya forest 
catalogs constructed from Gaussian random density field 
realizations. They simulate quasars with a density 15-17 
deg~?, g < 22, exposure depths comparable to those in 
BOSS, and a redshift 2.15 < z < 3.5. Noise is added 
to each quasar spectrum assuming typical sky and read- 
out noise from BOSS. They find that the errors on the 
Lya forest correlation function increase by 10-15% if the 
exposure times are halved, and conversely, that those er- 
rors would be reduced by 30% in the limit of zero noise. 
Because the error bars on the correlation function scale 
as the inverse square root of survey area, the simulations 
confirm a decision to favor area over depth in the Lya 
spectra. We report the quality of the BOSS Lya spectra 
in 16.4] in comparison to expectations from theoretical 
predictions for an optimal quasar survey. 

Investigating the large scale structure of Lya absorp- 
tion relies critically on our ability to classify quasars and 
determine their redshifts in the first place. Quasars at 
z > 2.15 are easily identified due to strong emission lines 
such as Mg II A2798, C III A1908, C IV 41549, and Lya 


A1215. As discussed in (2012), the typical 
central pixel for a gpsr ~ 22 DR9 quasar in these emis- 
sion line regions has a SNR= 0.62, SNR= 1.4, SNR= 3.0, 
and SNR= 4.3, respectively. For comparison, the median 
SNR per pixel over the whole spectrum at gpsr ~ 22 is 
0.90. In many spectra of quasars at gesr ~ 22, only Lya 
and CIV emission lines are used in identification; about 
half of quasar targets are confirmed to be at z > 2.15. 
While incompleteness is substantial, it is important to 
emphasize that the vast majority of objects that are not 
confirmed Lya quasars are likely to be stars, which lack 
the strong emission features of typical quasars. Some of 
the failures may also be BL Lac objects or other weak- 
lined active galactic nuclei (AGN). In any event, the level 
of spectroscopic incompleteness does not seriously im- 
pact our ability to measure structure in the Lya forest. 

As an additional test of the spectroscopic complete- 
ness of the quasar sample, we performed spectroscopy 
on seven plates with quasars selected to gpsr < 22.5, 
half a magnitude fainter than the typical limit for the 
main quasar sample. These targets fall on plates 5141- 
5147 and were chosen based on variability without the 
color selection that was imposed on the other variability- 
selected targets in Stripe 82. Comparing shallow expo- 
sures of plate 5141 (the only one of these plates included 
in DR9) to deeper exposures, we found a redshift suc- 
cess rate ~ 100% for quasar targets with gpgr < 22, and 
~ 80% for 22 < g < 22.5. A manual classification re- 
vealed that no objects were falsely classified as quasars 
in the shorter exposures. While the quality of the quasar 
spectra is clearly an essential feature of the BOSS survey, 
we conclude that even shallow BOSS exposures are ade- 
quate to reliably classify quasars and that survey area is 
more important than depth for signal in the Lya forest 
region. We therefore base our metric for exposure depths 
entirely on the redshift efficiency for the galaxy sample. 


5.8. Atmospheric and Galactic Extinction 


Designing the survey, scheduling plates for plugging 
and observation, and tracking survey progress against 
completion of the full footprint require that we have ac- 
curate predictions for the total exposure time that will be 
needed for each plate to reach at least the SNR thresholds 
given in In order to determine how airmass effects 
SNR?, we first predict the impact of observing condi- 
tions on the depth of the exposures. While the depth of 
each exposure is affected by seeing conditions, changes in 
atmospheric extinction, sky brightness, and cloud cover, 
these processes are stochastic and cannot be predicted for 
a given plate at the time of observation. On the other 
hand, airmass and Galactic extinction only depend on 
the plate coordinates and design hour angle. We used 
the data from the first year of BOSS observations to un- 
derstand their effect on the mean SNR?. 

Using the reported SNR? for each exposure in the first 
year from the SOS data reductions, we found that the 
scaling of the mean SNR? with Galactic extinction is 
well described by the expected relationship: 


SNR? = constant x 1072 4x /?5 (5) 


where Ax is the predicted extinction from|Schlegel et al. 
in magnitudes in the synthetic bandpass filter 
*X". We then quantify the effect of airmass on the SNR? 
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from the SOS reductions. Higher airmass degrades the 
SNR by introducing additional sky background, increas- 
ing the seeing, and reducing atmospheric transparency. 
The exposures are binned by airmass and averaged in 
each bin to account for all other weather effects. We find 
a power law dependence on airmass (Y); the mean SNR? 
for a single exposure at the fiducial magnitudes scales for 
the red and blue cameras as: 


SNA =75x Y1 (6) 
SNR} = 3.6 x YT? (7) 


These relations accurately describe the data up to an air- 
mass of 1.3. Beyond that limit, the data become roughly 
independent of airmass with substantial scatter. 


5.4. Determining Plate Hour Angles 


We attempt to assign the range of LST that each spec- 
troscopic plug plate can be observed in a manner that 
covers the available time allocation for BOSS with the 
highest SNR for the whole survey. We calculate the 
Galactic extinction at the center of each tile. We then 
determine the hour angle and corresponding airmass at 
each available timeslot shown in the right hand panel of 
Figure [4] for that tile. At this stage, we impose several 
constraints on the observations. Tiles within 6+5° of the 
APO latitude of +32°47’ must be designed at |HA| > 1 
hour to prevent zenith crossing, where telescope tracking 
in altitude-azimuth becomes uncertain. We then define 
a maximum hour angle for each tile to ensure that plates 
are not observed at an airmass that would lead to visi- 
bility windows that are significantly less than two hours. 
Finally, to simplify plate design, we assign hour angles 
in increments of 20 minutes. All plates are limited to a 
design hour angle of less than +3°20™. 

Using the models for dependence on airmass and 
Galactic extinction in $5.3] we estimate the effective 
exposure time required to complete each plate at each 
timeslot. We define the weight of each plate as the frac- 
tional increase in exposure time required to achieve a 
SNR? identical to a plate at zero Galactic extinction, 
airmass unity, mean seeing conditions and mean atmo- 
spheric conditions. We then rank the plates in order of 
increasing weight at each LST and determine the most 
expensive timeslots to perform observations. The regions 
that are the most expensive are those that have the high- 
est weight for the top-ranked plates. These times occur 
during the Galactic plane crossing at ~ 4.6 hours and 
~ 20 hours LST, where the fields tend to be observed at 
high airmass. 

Given the celestial boundaries in the survey, we assign 
hour angles to plates in the NGC and SGC indepen- 
dently. We start with plates in the NGC with the times- 
lot at 4.6 hours LST and the N highest ranked plates 
for the N days that are predicted to have observations 
at that LST. We simulate observations of each plate us- 
ing an integral number of 15 minute exposures, a 7596 
observing efficiency, and a SNR that evolves according 
to the change in airmass over the course of the observa- 
tion. The typical simulated observation of a plate with 
a ~ 7.5 hours takes between 1.5 and 2 hours. It is these 
plates that are observed at high hour angle around 4.6-6 
hours LST. We subtract the integrated exposure time for 
each plate over the range of LST in the same manner as 
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actual observations would take place. We similarly sim- 
ulate observations for the plates with a ~ 17 hours that 
are needed for observation at the 20 hour LST timeslot. 
We alternate between the eastern and western edges of 
the NGC, incrementally using plates closer to the center 
of the NGC as the LST approaches 12-13 hours, until all 
of the available time is used. We then assign hour angles 
to plates in the SGC following the same technique. 


5.5. Determining Exposure Depths and Updating Plate 
Designs 


We next evaluate the amount of time required to com- 
plete the plates to determine how much additional signal 
beyond the minimum thresholds (SNR > 10, SNR? > 
20) can be acquired while still completing the survey. 
The process of hour angle assignment is performed iter- 
atively by varying the minimum depth of each exposure 
and varying the LST range covered by the NGC and SGC 
plates. Simulated fifteen minute exposures are accumu- 
lated until the SNR? exceeds some minimum threshold 
as described in §4.2] We manually vary the thresholds 
of both the blue and red cameras until the simulations 
produce a completed survey in the amount of time al- 
located to BOSS. We adjust the LST times that divide 
plates between the NGC and SGC until both regions are 
observed in the amount of time allocated; plates from the 
NGC should be observed between an LST of 4.9 hours 
and an LST of 19.1 hours while plates from the SGC 
should be observed at other times. 

We find that we can complete the survey if we set a 
SNR? threshold of 22 per pixel for the synthetic i—band 
filter and 10 per pixel for the synthetic g—band filter at 
their respective fiducial fiber2 magnitudes. As argued in 
this depth is sufficient to accurately obtain redshifts 
for the vast majority of CMASS galaxy targets. The 
mean exposure time for the survey will be 1.61 hours per 
plate, amounting to a total of 3551 hours of exposure 
time. Because the plates in the NGC are located at a 
higher Declination on average, and in regions that have 
smaller amounts of Galactic dust, the typical plate in the 
NGC will be completed in 1.49 hours, while the average 
plate in the SGC will be completed in 1.92 hours. The 
projected time spent observing at each LST is shown in 
Figure[4]and a histogram of predicted plate SNR? for the 
full survey are shown in Figure[5] 

Figure 4| reveals two additional features of the BOSS 
survey projections. The first feature is a minor short- 
age of plates in the LST regions covered by the Galactic 
plane, appearing as dips around 5 and 20 hours. It is dur- 
ing these times that the observations are affected by the 
limitation on hour angles mentioned above. This short- 
age of plates was actually exacerbated during the first 
two years of observations. The second feature appears as 
an excess of plates between 21 and 3 hours LST. This is 
the part of the SGC that is visible in the summer months 
and early Fall when nights are short and when the tele- 
scope is closed for six weeks due to yearly maintenance. 
It is likely that BOSS will have unobserved plates in this 
region when the survey is complete. To minimize similar 
gaps at other ranges of LST, we update the survey pro- 
jections every few months to account for the amount of 
time remaining in the survey and the plates completed. 


5.6. Maintaining a High Observing Efficiency 


We require a high observing efficiency to accomplish 
the goal of completing the survey footprint within a fixed 
time window that is set largely by funding constraints. 
'To do so, we balance nightly observing efficiency against 
the higher priority of protecting the telescope and instru- 
ments to avoid down-time or catastrophic damage. Over 
the years since the beginning of SDSS, we have adopted 
a number of strategies to achieve this balance. 

We assign two night-time observers at all times as dis- 
cussed in $4.2] Two observers are scheduled mainly for 
observing efficiency and safety. The “warm” observer is 
responsible for setting up the software that controls the 
telescope and instruments, starting and monitoring data 
collection, checking SOS feedback, and other tasks that 
can be performed indoors. The “cold” observer’s duties 
include swapping cartridges, checking data quality and 
writing the night log. To minimize the amount of time 
between cartridge changes, the cold observer prepares 
for the change before the final exposure is completed. 
As soon as the final exposure is complete, the warm ob- 
server moves the telescope to zenith and the cold observer 
physically locks the system so that the telescope cannot 
move. As soon as the cold observer mounts the new car- 
tridge to the telescope, the warm observer updates the 
software control system to reflect the cartridge change. 
The cold observer, still outdoors, releases the telescope 
so that slewing can begin. With two observers on duty, 
we shorten each cartridge change by between five and six 
minutes (saving 2-4 exposures per night), ensure better 
data quality, and guarantee that all parts of the system 
are functioning correctly. Operations safety is another 
important reason for scheduling two observers. While 
the cold observer is performing duties outside, the warm 
observer is monitoring the process from a camera that 
feeds to video in the control room. Therefore, the sec- 
ond observer can respond quickly in the unlikely case of 
an accident. All observers are trained to perform both 
warm and cold roles to ensure continuity between observ- 
ing shift changes. 

Both observers monitor the weather conditions and de- 
termine when to close the telescope enclosure. In a nor- 
mal situation, it takes about seven minutes to close (in- 
cluding slewing the telescope to stow position and mov- 
ing the enclosure). However, our criteria for closing due 
to external conditions are somewhat conservative in case 
we must manually move the enclosure back over the tele- 
scope, which takes about 30 minutes. The observers will 
close the telescope during the night if any of the following 
conditions occur: 


e Any lightning is detected within 15 miles. 


e Precipitation is detected, or a radar return from 
precipitation is approaching that is thirty minutes 
away or less. 


e Smoke or ash is detected. 
e Wind speed is over 40 mph. 


e Humidity is high, with a dewpoint 2.5? C be- 
low ambient temperature. Observers also visually 
check for condensation any time the temperature 
drops within 4 C of dewpoint. 


e Integrated dust counts (likely from the White 
Sands gypsum dunefield or agriculture in the typ- 
ically dry valleys below the telescope) for the 
night exceeds 20,000-40,000 count-hours. The ex- 
act number depends on humidity, closing at lower 
integrated dust-hours during high humidity. 


e Ambient temperature is below -12 C. 


At the end of each night, logs are distributed to a 
broad list of APO day staff, observers, software develop- 
ers, and project leaders (including those for MARVELS, 
APOGEE, and SEGUE). These logs are partly generated 
automatically, covering most of the telescope, weather 
and instrument status. Any problems or unusual situa- 
tions are recorded manually by the observers, including 
relevant error messages and as much relevant information 
as is available. 
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6. DATA QUALITY 


The final imaging observations of the BOSS footprint 
were completed in the Fall of 2009 and are found in 
DR8 (Aihara et al]201la). Following the spectrograph 
rebuild in Summer of 2009, BOSS commissioning took 
place in Fall 2009. During commissioning, spectroscopic 
observations were performed on nights assigned to BOSS 
when conditions were not photometric. Full spectro- 
scopic survey operations began on December 5, 2009 
(MJD 55170) after the commissioning phase was com- 
plete, marking the first spectroscopic data that are in- 
cluded in DR9. Several additional improvements were 
made to the spectrograph during the survey. A few bugs 
in the software for guiding were resolved, residual tilt 
in the CCD focal plane was corrected, new triplet lenses 
were installed for all four cameras, the red CCDs were re- 
placed, and the collimator mirrors were recoated. These 
changes led to improved spectral resolution and through- 
put, with a combined improvement to survey efficiency of 
roughly 25%. The dates of these changes are documented 
in [Ahn et all (013); (2013) 

To give the reader a qualitative impression of the BOSS 
data quality from these first two years of data, we present 
a few examples of galaxy, quasar, and stellar spectra in 
Figure[6] As with SDSS, the enormous size of the BOSS 
spectroscopic sample also includes many classes of rare 
objects. A few examples of unusual spectra are shown 
in Figure[/| In the following section, we describe some 
global characteristics of the BOSS spectra. 
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Fic. 6.— A mosaic of BOSS spectra (black) and best-fit idlspec2d templates (red) of varying luminosity. The plate, MJD, fiber, and 
fiber2 magnitude are listed at the top of each panel. Left (Top Three Panels): Galaxy spectra from the LOWZ and CMASS samples 
in order of decreasing luminosity and increasing redshift: a z = 0.238 LOWZ galaxy, a z = 0.541 CMASS galaxy, and a z = 0.695 CMASS 
galaxy that is slightly fainter than the igp2 < 21.5 threshold that was imposed after year one. In each case, the top spectrum is offset by 
2.5 x 10777 erg s^! cm 2 À-1 and the bottom spectrum is smoothed with a five-pixel median boxcar filter. Right (Top Three Panels): 
Unsmoothed quasar spectra from the CORE and BONUS samples in order of increasing redshift: a z = 2.57 quasar, a z = 3.14 quasar, 
and a z = 3.53 quasar. Bottom: Unsmoothed spectrum of a standard star (Left) and of a white dwarf star (Right). 
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FIG. 7.— A mosaic of example spectra of rare objects discovered in the BOSS spectroscopic sample. Top: Spectrum of a carbon star 
(left) and a cataclysmic variable with strong emission lines (right). Bottom: Spectrum of a post-starburst galaxy at z = 0.95 with AGN 
activity (left) and a featureless BL Lac imprinted with absorption lines from Fe II A2382, Fe II A2600, and the Mg II A2798,2803 doublet 
at z — 0.67 (right). The idlspec2d template (red) for the post-starburst galaxy is a good fit to the data. For this galaxy, the raw data is 


displayed with an offset of 2.5 x 10717 erg s 


true flux density. 


1 


cm~2A~! and the spectrum smoothed with a five-pixel median boxcar filter is shown at 
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6.1. BOSS Spectrophotometry 


As described briefly in $4.3] the flux calibration for 
each exposure is corrected as a function of position in 
the focal plane using PSF magnitudes of standard stars 
as a baseline. Centroiding errors, extended objects, guid- 
ing errors, transparency variations and other uncertain- 
ties in the corrections can lead to spectrophotometric 
errors above the measurement uncertainty over the 3° 
field. As explained in (2004), an anal- 
ysis of magnitudes synthesized from SDSS galaxy spec- 
tra showed an RMS dispersion of 5% in (g — r) and 3% 
in (r — i) relative to the colors measured from 3" fiber 
magnitudes from SDSS photometry. At the bluest wave- 
lengths (3800 A), the error was closer to 1296. Comparing 
stars with a PSF magnitude brighter than 19, the SDSS 
spectrophotometry was biased 0.02 magnitudes brighter 
with an RMS dispersion of 0.05 magnitudes in r (DR6; 
[Adeliman-MeCarth et all Z008). Similarly, the (g — r) 
colors showed a bias of 0.02 magnitudes with 0.05 mag- 
nitude dispersion while the (r — i) colors showed a bias 
of -0.01 magnitudes with 0.03 magnitude dispersion. 

We performed a similar analysis on the flux calibra- 
tion of BOSS spectra using objects with 15 < gfp2 < 19. 
As shown in Figure [8] we find slightly larger bias and 
RMS dispersion in fluxing errors for stars and galaxies 
than was reported in SDSS. The larger fluxing errors are 
not surprising because the BOSS fibers are smaller in di- 
ameter and therefore more susceptible to guiding offsets. 
For the standard stars we find BOSS spectrophotome- 
try to be on average 0.014 magnitudes fainter than the 
PSF photometry with an RMS dispersion of 0.058 mag- 
nitudes in r. BOSS spectrophotometry is 0.038 magni- 
tudes fainter (0.068 magnitude dispersion) than the PSF 
photometry in g. The (g — r) colors are 0.022 mag- 
nitudes redder (0.063 magnitude dispersion) while the 
(r — i) colors are 0.004 magnitudes bluer (0.035 magni- 
tude dispersion). For galaxies, we compare only colors, 
using the 2" fiber magnitudes from SDSS imaging, and 
find that (g — r) colors are 0.048 (0.058 magnitude dis- 
persion) and (r — i) colors are 0.013 magnitudes (0.035 
magnitude dispersion) redder for the spectra. As shown 
in the same figure, the offsets and dispersion of spec- 
tra are much larger for objects targeted as quasars but 
confirmed to be stars. On average, stellar contaminants 
in the quasar sample are 0.16 magnitudes fainter in r, 
with an RMS dispersion of 0.29 magnitudes; these ob- 
jects are 0.038 (0.158 magnitude dispersion) and 0.070 
(0.099 magnitude dispersion) magnitudes more blue in 
g —r and r — i colors respectively. 

'The application of washers and the offsets in plate po- 
sition for the quasar fibers accounts for the warping of 
the quasar target spectrophotometry relative to the stan- 
dard stars. The quasars have higher throughput in the 
blue relative to the standard stars at a cost of reduced 
throughput in the red, and this difference is not included 
in the flux calibration. The effect is exacerbated by the 
smaller fibers (now comparable in diameter to the typ- 
ical 1.5" seeing at Apache Point) relative to those used 
in SDSS. 


'The net effects of these sources of error can be demon- 
strated in the spectrophotometry of plates 3615 and 
3647, both located on the celestial equator at a = 37°. 
The two plates contain identical targets but were drilled 
for different airmass and were observed a total of nine 
times in the first two years to monitor the system. An 
example of the spectrophotometric calibration accuracy 
from these two plates is shown in Figure [9] The left 
panel displays a quasar observed twice at different air- 
mass. Note the clear trend toward bluer synthetic pho- 
tometry in the spectrum that was obtained at larger air- 
mass; ADR causes the red light to be offset from the 
center of the fiber for the quasar target in an opposite 
sense to the ADR effect on the spectrophotometric star 
fibers (optimized for throughput at 5400 A). Also shown 
in the figure is the spectrum of a star that was targeted 
as a likely quasar and therefore positioned in the offset 
quasar focal plane. The stellar profile is known to be 
constant over the period of 29 days between the two ob- 
servations, yet the spectra show the same trend toward 
bluer colors. 

An understanding of spectrophotometric accuracy is 
critical for studies of cosmology, galaxy evolution, and 
quasar physics. evaluates small wavelength- 
scale residuals in the flux calibration that can con- 
taminate weak emission and absorption features in the 
spectra. Using SDSS spectra, he estimates that the 
wavelength-dependent relative flux calibration is accu- 
rate at the 1-2% level. A similar analysis has not been 
published with BOSS spectra, but the similarities in the 
data reduction pipeline make it likely that the BOSS 
spectrophotometry has similar accuracy on small wave- 
length scales. The broad band spectrophotometry ac- 
curacy described here is biased at less than 596 for the 
standard stars and galaxy targets, with a comparable 
amount of scatter. This precision does not seem to sig- 
nificantly reduce the galaxy redshift efficiency; as was 
shown in the igy2 < 21.5 CMASS spectra are suc- 
cessfully classified for more than 94% of the objects. The 
estimates of color scatter and bias described in this sec- 
tion should be taken into consideration for science ap- 
plications that require precise spectrophotometry; re- 
peat spectra can be used to verify the reproducibility 
of measurements. In general, galaxy evolution studies 
should use the 2" fiber magnitudes for comparison of 
spectrophotometric color to photometry, as the central 
region of extended galaxies contributes the majority of 
light to the BOSS spectra. The intrinsic quasar contin- 
uum must be estimated before Lya forest analysis and is 
highly susceptible to spectrophotometric errors because 
of the fiber offsets. The Lya analysis of 
effectively treats the broadband power introduced 
by spectrophotometric errors as a nuisance parameter in 
continuum fitting. The continuum fitting procedure out- 
lined [Eee et ai] has been applied to the BOSS 
data, and will be described in a public release of model 
quasar continuum 
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Fic. 8.— The difference between the synthetic photometry computed from each spectrum and the measured photometry from the SDSS 
imaging data. Left: Histogram of the spectrophotometric offsets for stars. The black line shows gsynthetic — gpsr for the standard stars 
(top) and for a sample of stars that appeared as contaminants in the CORE and BONUS quasar samples (bottom). Similarly, the 
blue lines show the offsets for the r filter while the red lines show the offsets for the i filter. Right: Histogram of the color differences 
(g — r)synthetic — (g — r)Psr shown in the top panel and r — i in the bottom panel. Standard stars are presented as the blue line using PSF 
magnitudes, galaxies as the black line (fiber2 magnitudes), and stellar contaminants in the quasar sample as the red line (PSF magnitudes). 
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Fic. 9.— Quasar target spectra smoothed with a five-pixel median boxcar filter, covering the wavelength range 3500 A through 6500 A 
taken at different airmass, demonstrating the effects of atmospheric differential refraction and other guiding errors on spectrophotometry. 
Left: Spectrum of a ggp2 = 19.48 quasar at z = 2.49 observed at an airmass of 1.2 (black) and again at an airmass of 1.4 (blue). Right: 
Spectrum of a white dwarf with ggy2 = 21.53 on the same observations with the same color pattern. 
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6.2. Galaxy Spectroscopic Data Quality 


perform a spectroscopic analysis 
of the 492,450 galaxy spectra that are part of DR9. They 
show that the typical signal-to-noise ratio of BOSS spec- 
tra is sufficient to measure simple dynamical quantities 
such as stellar velocity dispersion for individual objects. 
'The typical error in the velocity dispersion measurement 
is 14%, and 93% of BOSS galaxies have velocity disper- 
a with an accuracy better than 30%. 
further show that emission lines can be measured 
on a spectra, but the majority of BOSS galaxies lack 
detectable emission lines, as is to be expected because of 
the target selection design toward massive galaxies. 

In order to independently assess the performance of 
BOSS idlspec2d in determining galaxy redshifts, the 
code *RUNZ T] was run on all extracted galaxy spectra. 
Comparisons between the RUNZ and idlspec2d redshifts 
gave important indications of common failure modes in 
the idlspec2d redshifts at the beginning of the survey. 
'The results from RUNZ will be made available with DR9. 

RUNZ was _ originally developed for 2dFGRS 


(Colless et al] [2001] and has since been optimized 


for MEN LRG reden surveys such as the 28LAQ Sur- 
vey (Cannon et al] [2008] and the AAOmega UKIDSS 
T (AUS) LRG Survey (Wake et al. in prep). 
The el has a been used to determine red- 
shifts for emission mu alaxies in the WiggleZ survey 


(Drinkwater et al] DOO and for a broad range of 
galaxy types in the Galaxy and Mass Assembly survey 
OAN [Driver etal DORA BOLT). RUNZ provides 
s redshift estimates for each input spectrum; the 
first by cross-correlating each spectrum with a series 
of galaxy and stellar templates (with the emission 
lines masked) and the second by fitting a Gaussian to 
detected emission lines and identifying multiple matches 
at a single redshift. An integer value (q) between one 
and five is assigned to reflect the quality of the redshift. 
RUNZ was modified for BOSS and tuned to assign 
quality values and input templates based on the visual 
inspection of one of the commissioning plates. 

Selecting all repeat spectra, we identify the primary 
spectrum (also requiring q > 3) and test the reliability 
of the RUNZ quality flags and redshifts assigned to the 
other spectra of the same object. We attempted to tune 
the quality value q = 3 to correspond to a 95% relia- 
bility and q = 4 to correspond to a > 99% reliability. 
Table [5] shows the fraction of repeat spectra pairs that 
yield the same redshifts, where agreement is defined as 
a velocity difference dv = (z1 — 22)/(1+ 22) *« c < 500 
km s~!. Analogous statistics are reported in Table [5] for 
the idlspec2d redshifts, where a reliable redshift is desig- 
nated as having ZWARNING = = 0. The results agree 
with the findings of Ross et al. [Ross et al] (20124 , who report that 
LOWZ and CMASS galaxies that were observed multi- 
ple times produced idlspec2d redshifts with a deviation 
6z/(1+z) < 0.001 in 99.7% of the cases when the spectro- 
scopic classification produced ZWARNING = 0. Vi- 
sual inspection of a subsample of those data confirmed 
this result, indicating that the rate of catastrophic fail- 
ures among the galaxy targets is less than 1%, as required 
to meet the BAO requirements. 
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TABLE 5 
REDSHIFT CONSISTENCY OF REPEAT SPECTRA 


Condition Condition Number Consistency 
of Object 1 of Object 2 of Pairs Fraction 
q<2 q>3 448 0.58 
q=3 q>3 1298 0.93 
q>3 q>3 24075 0.996 
q23 q>3 25373 0.992 
ZWARN = 0 ZWARN = =0 27304 0.996 


ZWARN >0 ZWARN = 0 771 0.68 


We estimated the redshift accuracy for RUNZ and for 
idlspec2d by using the same repeat spectra and requir- 
ing all pairs to have q > 2 (or ZWARNING = 0). The 
distribution of óv for RUNZ and BOSS are shown in the 
bottom panel of Figure[10] The idlspec2d redshifts are 
significantly more repeatable, indicating a single-epoch 
RMS uncertainty of 38 km s^! compared to 81 km s^! for 
the RUNZ redshifts (after removing pairs with dv > 2000 
km s71). The bottom panel of Figure shows the 
dependence of the RMS in óv on igpe. The idlspec2d 
pipeline now performs better at all magnitudes, contin- 
ues to improve at bright magnitudes, and grows more 
slowly with decreasing flux when compared to the RUNZ 
results. 
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Fic. 10.— Statistics of dv in pairs of spectra for the same galaxy. 
Top: Histogram of óv for RUNZ (solid black) and for idlspec2d 
(dashed). Bottom: RMS width of distribution binned at half 
magnitude intervals in igp3. RUNZ is presented as solid circles 
while idlspec2d is presented as open circles. 
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Fic. 11.— Examples of spectra that produced discrepant redshifts between the RUNZ analysis and the idlspec2d analysis due to a 
superposition of multiple objects. The RUNZ fit is shown in the top panels and the idlspec2d fit in the bottom panels. Three spectra are 
plotted in each panel, from top to bottom: raw BOSS spectrum, BOSS spectrum smoothed with a 7-pixel boxcar filter, and best-fit spectral 
template. Typical line features are identified at the top of each panel by name, flux density is presented in arbitrary units. As before, the 
plate, MJD, fiber combination is presented at the top of each panel. Left: Example spectrum where the RUNZ redshift corresponds to a 
foreground passive galaxy and the idlspec2d redshift corresponds to a background emission-line galaxy. Right: Example spectrum where 
the RUNZ redshift corresponds to a background passive galaxy and the idlspec2d redshift corresponds to a foreground emission-line galaxy. 
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Fic. 12.— Examples of spectra that produced discrepant redshifts between the RUNZ analysis and the idlspec2d analysis where one is 
incorrect. The labels are the same as those in the previous figure. Left: Example spectrum where manual classification reveals the RUNZ 
redshift to be correct and the idlspec2d redshift to be incorrect. The spectral distortion around 6000 A is likely due to poor calibration 
in the dichroic region, a rare occurrence in BOSS spectra. Right: Example spectrum where manual classification reveals the idlspec2d 
redshift to be correct and the RUNZ redshift to be incorrect. 


When comparing the results of RUNZ directly to 
the idlspec2d pipeline for identical spectra, we find the 
RUNZ redshifts are lower by a median of 8 km s^!. The 
RMS scatter between the two algorithms is 72 km s7}, 
consistent with the larger RMS scatter observed in the 
repeat spectra of RUNZ redshifts. Possible redshift fail- 
ure modes are identified by comparing instances where 
both idlspec2d and RUNZ indicate a correct redshift for a 
galaxy but the redshifts differ significantly. There are 311 
such galaxies in DR9 where ZWARNING= 0, q > 3 
and (ZmuNz — Zidlspec2a)/ (1 + Zialspec2a) * € > 1000 km 
s71. Visual inspection reveals a number of causes for 
the failures. In many cases, both redshifts are valid be- 
cause a superposition of two different objects at different 
redshifts leads to two correct results; two such exam- 
ples are presented in Figure [I] In the case of a star- 
galaxy superposition, the idlspec2d pipeline mildly favors 
a redshift assigned to the stars compared to RUNZ. How- 
ever, in the case that background emission-line objects 
are projected on foreground galaxy light, idlspec2d tends 
to select the background emission-line redshift whereas 
RUNZ selects the foreground redshift. A genuine fail- 
ure mode for idlspec2d appears in instances of bad data, 
either where the relative normalization of the red and 
blue spectra has failed or some broad band shape has 
been introduced into the blue spectra. An example of 
such a spectrum is shown in the left hand panel of Fig- 
ure[12] RUNZ appears to be less affected by such prob- 
lems, as it doesn't fit the broad band shape of the spec- 
tra. The most common failure for RUNZ occurs in strong 
emission-line galaxies. These failures may result from in- 
adequate templates combined with an insufficient weight 
given to the emission-line redshift relative to the cross- 
correlation redshift as a result of the LRG optimization 
of this version of RUNZ. 

While RUNZ was used to help diagnose BOSS galaxy 
redshifts early in the survey, we concentrated our sub- 
sequent software development entirely on the idlspec2d 
data reduction pipeline and redshift classification. The 
algorithms employed for redshift determination of BOSS 
targets are described in [Bolton et al] (20121), including 
a more thorough description of templates and redshift ef- 
ficiency than what is found here. Even though idlspec2d 
performs slightly better than RUNZ in terms of com- 
pleteness, reliability, and redshift repeatability, RUNZ 
would probably perform better if it were fully optimized 
for the BOSS data. 


6.3. Quasar Classification 


Due to the variety of quasar spectral features and the 
complexity of automatic classification, members of the 
French Participation Group supplemented the idlspec2d 
results with a visual inspection of the spectra. The visual 
inspection provides a secure identification and a reliable 
estimate of the redshift for each object in addition to a 
characterization of other fundamental quasar properties. 
In total, 189,018 spectra were inspected in the DR9 sam- 
ple. We provide a brief summary of the inspections; full 
details sed found in (013). 

All objects that were targeted as quasars or classified as 
quasars with z > 2 by idlspec2d were visually inspected 
and added to the DR9 quasar catalog (DR9Q; 
[2012), which is included with DR9. The inspections in- 
clude not only CORE and BONUS quasars, but also 
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TABLE 6 
KEY STATISTICS FROM DR9Q 


Classification # Visually Inspected Fraction 
Quasar 87,822 1.00 
Quasar with z > 2.15 61,933 0.70 
BAL Quasar 7,032 0.086 
DLA Quasar 7,492 0.085 


objects from ancillary programs focused on quasars. A 
fraction of objects in the galaxy sample were identified as 
quasars by idlspec2d and visually inspected as well. The 
catalog provides human classifications of the objects, re- 
fined redshift estimates and emission line characteristics. 
Peculiar spectral features that could affect any Lyo for- 
est analysis, such as damped Lya (DLA) systems and 
broad absorption line (BAL) quasars, are first flagged in 
visual inspection. DLA systems are then evaluated with 
an automated characterization to determine column den- 
sities (Noterdaeme et al][2009). Those results will be re- 
ported in a separate catalog (Noterdaeme et al] [2012]. 
BAL quasars are processed for automatic estimates of 
the balnicity index (BI; and ab- 
sorption index (AI; of C IV troughs to 
quantify their strength. Problems identified in visual in- 
spection such as the presence of artificial breaks in the 
spectrum, poor flux calibration or bad sky subtraction 
are flagged as well. 

The visual inspection starts from the output of idl- 
spec2d for both classification and redshift. We find 
that fewer than 0.396 of objects classified as quasars 
with ZWARNING = 0 by idlspec2d have misidentified 
the observed emission lines (typically at z « 2, where 
Lya emission is not observable), leading to redshift er- 
rors Az > 0.1. About half of the objects classified as 
quasars by idlspec2d have redshifts adjusted by less than 
Az > 0.1 (Az < 0.005) after their visual inspection, 
when the best fit idlspec2d template misses the position 
of the Mg II emission line peak or when the maximum 
of the C IV emission line determines the redshift. The 
precision of quasar redshifts are further refined using a 
linear combination of carefully crafted principal compo- 
nents fit to each spectrum (see, e.g., [Paris et al][2011). 
Indeed, known shifts between emission lines are intrinsi- 
cally imprinted in the eigenvectors and this method also 
takes into account the quasar-to-quasar variation. This 
redshift estimate is also provided in addition to the idl- 
spec2d results as part of the DR9Q. 

About 12% of the quasar targets in the CORE and 
BONUS samples are assigned a non-zero ZWARN- 
ING flag in the idlspec2d pipeline. Visual inspection 
reveals that 13% of these objects are truly quasars and 
about 7% have z > 2.15. Of those objects that are clas- 
sified with ZWARNING = 0 as quasars in idlspec2d, 
approximately 1% are revealed to be stars. Similarly, 
only 23 objects classified as stars with ZWARNING 
= 0 are shown to be quasars in visual inspection, only 
five of which are z > 2.15 quasars. A summary of the 
classifications in DR9Q is provided in Table [6] 


6.4. Lyman a Forest Measurements 
As described in $5.2] the exposure depths over the full 


survey are sufficient to ensure a nearly 100% redshift 
completeness among true quasars. [McQuinn & White 
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Fic. 13.— Statistics of the depth of the spectra in the Lyo forest region 1041 A to 1185 A (rest frame). Only quasars at z > 2.5 are 
included, so that the entire Lya forest lies within BOSS wavelength coverage. On average, the sampling is bit more than 1 A pixel-! in 
this subsample. Left: Mean SNR per pixel as a function of extinction-corrected gpsp magnitude. The error bars represent the standard 
deviation of all quasars in each bin of width 0.1 magnitudes. Right: Distribution of the SNR per pixel. The solid line is taken from a 
sample of quasars near the bright end (gpgr ~ 20) of the sample. The dotted line represents a typical quasar (gesr ~ 21). The dashed 
line shows quasars near the faint magnitude limit gpgp ~ 22.0 of the CORE and BONUS target selection algorithms. 


also address the question of a survey's sensitiv- 
ity in the Lya forest using a metric vn. The parameter 
Vn ranges from zero to one and represents the effective 
value of each quasar to constraining the 3D power spec- 
trum depending on the redshift and SNR in the Lya 


forest region. Assuming the quasar redshift and luminos- 
ity distribution offer e al] (2006), they find that 
Vn roughly doubles as SNR per A increases from one to 
two for a quasar at z ~ 3. Similar to 
(2012), they argue for a survey that favors area over 
depth, citing specifically that the improvement in sen- 
sitivity is marginal once the noise exceeds SNR= 2 per 
A for the faintest quasars. The specific Lya data quality 
modeled in these projections lends themselves to a direct 
comparison to the BOSS data obtained in the first two 
years of the survey. 

Using the DR9 quasar sample, we evaluate the depth 
of the spectra in the Lya forest for comparison to the 
(2011) projections. Using only ob- 
jects that were classified as quasars by idlspec2d with 
ZWARNING= 0, we compute the SNR per coadded 
BOSS pixel between 1041 À and 1185 À in the rest frame 
of each quasar. In the co-added spectra, the sampling at 
4300 À corresponds to roughly 1 À per pixel, leading 
to a sampling that is easily compared to the projections 
of[McQuinn iz Whitd (2011). The sampling of the Lya 
forest is somewhat higher for quasars at z = 2.15 and 
slightly lower for quasars at z = 3.5. The SNR statistics 
of the Lya forest pixels are shown in Figure 'The 
typical quasar with gpsp < 21.5 exceeds SNR= 2 per 
pixel while the fainter quasars exceed SNR= 1 on aver- 
age, leading to values vn = 0.5 and vn = 0.2 at z = 2.6 
for the two samples respectively. The target density and 
survey depth results in a SNR that is sufficient for Lya 
constraints on the BAO peak. 


T. CONCLUSION 


The data from the 2009-2010 and the 2010-2011 BOSS 
observing seasons are included in DR9. Observations 
followed a pace that was close to the survey projections; 


236 unique plates were completed in first year compared 
to an expectation of 331. While this rate was about 
3096 behind the required rate to complete the survey, 
the slow start can largely be explained by the deeper ex- 
posures that were acquired. The SNR? thresholds were 
reduced by 37.596 on the blue cameras and 1596 on the 
red cameras after that first year; 577 unique plates were 
completed in the second year, compared to an expecta- 
tion of 454. The rate of plate completion benefited from 
better than average weather in the spring of 2011. In to- 
tal, 813 unique plates have been completed, 3.596 ahead 
of the projected progress to complete the survey by the 
summer of 2014. As reported in Anderson et ai] (2013), 
these plates account for 3275 deg? of unique sky cover- 
age, approximately 3596 of the full footprint. The rate 
of plate completion as a function of time during the first 
two years of BOSS is shown in Figure [I4] 

'The plates completed as a function of equatorial coor- 
dinates are presented in Figure [I] This figure reveals a 
high rate of completion in the tGC and a lower rate in the 
SGC. The difference between the NGC and SGC is due to 
the unusually good weather in the Spring of 2011 and the 
fact that commissioning occurred during most of the time 
that the SGC was visible in 2009. The difference is more 
acute when comparing the 110? < a « 130? region of the 
NGC to the a > 330° region of the SGC. As explained 
in $5.4] the Galactic plane crosses at LST ~ 4.6 hours, 
and many of those NGC fields must be observed near 
that time. As shown in Figure [5] the shorter nights and 
New Mexico monsoon season result in a smaller amount 
of observing time available to observe those SGC plates. 
Looking beyond the second year and assuming average 
weather, we predict that we will complete the NGC re- 
gion of the survey in less than the allotted time, but 
will likely need better than average weather to finish the 
a > 330° region of the SGC. 

The DR9 sample is the first sample of BOSS spectra 
released to the public and is the data that define the ini- 
tial sample for the BOSS galaxy and Lya BAO analyses 
described below. DR9 contains 324,198 unique CMASS 
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Fic. 14.— Left: Progress of the survey at the time of DR9. The solid curve represents the number of unique plates completed as a 
function of time in the first two years of BOSS. The dashed curve represents the projected rate required to complete the baseline survey 
of 2208 plates. The yearly “pauses” are the result of the summer shutdown for telescope maintenance during the New Mexico monsoon 
season. The plate completion thresholds were changed in the summer of 2010 from SNR? = 26 to SNR? = 22 and from SNR2, — 16 


to SNR% = 10. Center: Average number density as a function of redshift for the SDSS (dash-dotted), LOWZ (dotted), and CMASS 


DR9 galaxies with ZWARNING_NOQSO= 0. Right: Redshift distribution of DR7 (dotted), DR9 (solid), and unique DR9 (dashed) 


quasars. 


TABLE 7 
DR9 REDSHIFT DETERMINATION FOR MAIN TARGETS 


Sample Unique Targets Repeat Observations 
LOWZ 103,729 7,646 
CMASS 324,198 29,493 
Quasar 154,433 11,601 


targets and 29,493 repeat observations from the CMASS 
sample. Exceeding survey requirements, the spectra 
from 98.7% of the CMASS targets produced ZWARN- 
ING_NOQSO = 0 in at least one of the observations, 
and 95.4% were confirmed as galaxies. Contamination 
from stars accounts for the 3.3% of successfully classified 
objects that were not galaxies. As expected, the statis- 
tics from the LOWZ sample are even better; 103,729 
unique targets (7,646 repeat observations) produced a 
99.9% successful rate of classification; 99.2% of LOWZ 
targets were successfully classified as galaxies. 

There were 154,433 unique quasar targets from the 
main sample, with 11,601 repeat observations. The idl- 
spec2d pipeline classified 79.2% of these objects success- 
fully and determined 51.5% of the 154,433 objects to be 
quasars. The numbers are similar to the numbers found 
in manual classification described in with minor dif- 
ferences explained by the inclusion of ancillary programs 
in the manual inspections and occasional disagreement 
between the two techniques. A summary of the DR9 
statistics is found in Table[7]and the redshift distribution 
for all successfully classified main galaxies and quasars is 
shown in Figure [I4] 


7.1. BOSS in the Context of Previous SDSS 
Spectroscopy 


As discussed in dI] the improvements to the BOSS 
spectrograph and increased cosmological volume of 
BOSS produce a new sample that is both complementary 
and a significant expansion beyond SDSS. The SDSS-I 
and II surveys obtained roughly 1.8 million spectra of 
galaxies, stars, and quasars (DR8: Aihara et _alJ[2011a). 
There are now about as many galaxies in the BOSS 
LOWZ sample as in the LRG sample used to derive the 
BAO constraints from SDSS, while the sample of CMASS 


Failed Redshifts Purity — Contaminating Objects 
119 102,890 120 
4167 309,307 10724 
31945 79,570 42918 


galaxies represents a probe of an entirely new cosmolog- 
ical volume. The SDSS sample consists of more than 
100,000 quasar spectra, with a median redshift of z — 1.5 
and absolute magnitudes M; < —22.0 


[2010). Just over 19,000 of these quasars lie beyond 


z = 2.15; the new combined sample of Lya quasars is 
already a factor 3.7 larger than what was available with 
SDSS. 

BOSS produces more objects at high redshift largely 
due to the improved throughout of the spectrographs and 
the decreased sky background from the smaller fibers. 
The improved sensitivity is demonstrated through a com- 
parison of the typical SNR in BOSS spectra relative to 
SDSS. By comparing the quality of data from a random 
sampling of stellar point sources from both samples, we 
find roughly a factor of two improvement in SNR per 
pixel in BOSS. A SNR vs. magnitude scatter plot of 
individual objects measured in the synthetic bandpass 
filters griz is shown in Figure[15} We do not include the 
synthetic u filter because of the significant increase in the 
wavelength coverage of BOSS in that wavelength range. 

The ambitious extension to redshifts beyond z = 0.6 
for the CMASS sample was made possible not only by im- 
provements to the spectrographs, but also by the proven 
reliability of the idlspec2D pipeline to classify objects 
with spectra as low as SNR~ 2 per coadded pixel in 
SDSS. An example of a successful classification of a faint 
SDSS LRG is shown in Figure [16] For comparison, a 
CMASS galaxy with reliable redshift is shown in the 
same figure. The BOSS galaxy has about the same SNR, 
but has almost 1.5 magnitudes less flux entering the fiber. 

As important as increasing the number of Lya quasars 
is the superior wavelength coverage of BOSS relative to 
SDSS. The expanded coverage from 2 3780 A to A Z 


3615 A makes the redshift range 2.15 < z « 2.3 more 
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Fic. 15.— SNR per pixel as a function of extinction-corrected PSF magnitude in each of the four synthetic bandpass filters (griz) 
matching the photometric passbands of the SDSS imaging survey. In each case, the red dots represent BOSS data while the black dots 
represent SDSS data. The data were selected from a random sampling of objects classified as stars over the full SDSS and BOSS samples. 
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Fic. 16.— Left: Comparison of the BOSS spectrum of one of the faintest CMASS targets (z = 0.694, igp2 = 21.42, mean(SNR/pixel)= 
1.82) to the SDSS spectrum of one of the faintest LRG targets (z = 0.395, igh = 19.94, corresponding to 3” fiber, mean(SNR/pixel)— 2.38). 
From top to bottom, with offsets of 8, 6, 2, and 0 x10-!7 erg s^! em~?A~! respectively, are the raw BOSS spectrum, the BOSS spectrum 
smoothed with a five pixel boxcar median filter, the raw SDSS spectrum, and the SDSS spectrum smoothed with a five pixel boxcar median 
filter. In all cases, the red curve represents the best-fit spectral template used to determine the redshift. The plate, MJD, fiber combination 
for the two objects are displayed at the top of the panel. Right: Corresponding spectral coverage of the quasar sample in BOSS (solid 
curve) relative to SDSS (dashed curve). The histogram represents the number of objects sampled at each restframe wavelength. 


accessible for Lya forest studies. As shown in Figure[14] 
19% of the Lya quasars for BOSS lie between 2.15 < 
z « 2.3. Figure shows an increase in the number 
of observations at each wavelength in the Lya forest for 
BOSS relative to SDSS. 


7.2. Early Science Results from BOSS Data 


In addition to the technical papers summarized in 41} 
members of the SDSS-III collaboration have published a 
number of papers using the BOSS data. While cosmology 
was the primary driver, the size and quality of the data 
sample enable many other studies of galaxy evolution 


and quasar physics. 

applied principal component analy- 
sis (PCA) to the rest-frame 3700-5500 À spectral region 
to estimate the properties of the BOSS galaxies. They 
find the fraction of galaxies with recent star formation 
in the CMASS sample first declines with increasing stel- 
lar mass, but flattens above a stellar mass of 101-5 Mo 
at z ~ 0.6. This is contrast to z ~ 0.1, where the frac- 
tion of galaxies with recent star formation declines mono- 
tonically with stellar mass (right panel of Figure 16 in 
[Chen et al]D0T3). 'The PCA fits as well as galaxies prop- 
erties, such as stellar masses, stellar velocity dispersions, 
are included as part of the DR9 catalog. E (2012) 
present the velocity-dispersion probability function for 
each galaxy in the LOWZ and CMASS samples. Corre- 
lating velocity dispersion with absolute magnitude and 
redshift, they find that galaxies of constant mass exhibit 
higher intrinsic scatter in velocity dispersion at higher 
redshifts. present the first BOSS 
galaxy mass functions at redshift 0.4 < z < 0.7 using 
the BOSS spectroscopic redshifts and ugriz photometry. 
A comparison to galaxy formation models shows that 
data-derived mass functions compare well with models 
at the typical BOSS redshift (z = 0.55), even at the high 
mass end. However, the data do not show the evolution 
to zero redshift that is predicted in models of hierarchi- 
cal mass build-up. The BOSS spectroscopy and SDSS 
imaging data has also been used to study the luminosity 
functions of galaxies surrounding LRGs (Tal et a1]2015). 
The data require a multiple component fit to produce a 
luminosity function consistent with the distribution of 
central galaxies and the most luminous satellite galaxies. 
'The result stands in contrast with models that predict 
mass growth of central galaxies through major mergers. 

Many observations with the Hubble Space Telescope 
(HST) have been taken inside the BOSS footprint. These 
high spatial resolution data offer a counterpart to the 
BOSS spectra, allowing, for example, a study of the 
morphology and size of the CMASS sample of galax- 
ies (Masters ct aT] POT I). 'The study reveals that the 
majority of CMASS galaxies (74 + 6%) exhibit early- 
type morphology as intended in the target selection al- 
gorithms. An additional color cut excluding galaxies 
with g — i < 2.35 increases the population of early-type 
galaxies to > 9096, producing a subsample that is sim- 
ilar to that of the SDSS-I and II LRG galaxies. In an 
approach similar to that from the Sloan Lens Advanced 
Camera for Surveys (SLACS; 
[2009], (2012) present a 
catalog of strong galaxy-galaxy gravitational lens sys- 
tems discovered spectroscopically in the CMASS sample 
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and observed with HST. Known as the BOSS Emission- 
Line Lensing Survey (BELLS), the HST imaging and 
BOSS spectra enable a reconstruction of the mass pro- 
file of the lensing galaxies. per- 
form a joint analysis on the SLACS and BELLS samples 
and determine that galaxies tend to have steeper mass 
profiles with decreasing redshift. Unlike the the studies 
of [Tal et al] (2012), the results indicate that major dry 
mergers play a major role in the evolution of massive 
galaxies. 


Finally, highlighting the potential for multi-wavelength 
studies, the first detection (Hand et al|2012) of the kine- 
matic Sunyaev-Zel'dovich effect (Sunyaev & Zel’dovic 


was discovered in a correlation of BOSS galaxies 
with maps made by the Atacama Cosmology Telescope 
2010). This is surely the first such result 
in a stream of studies that will combine BOSS spectra 
with data from projects such as WISE, XMM, Chandra 
and Planck. 

Results from the main quasar sample are now also be- 
ginning to appear. The first such study constrains the 
two-point correlation function over the redshift range 
22<2< 2.8 (White et al] 2012). 'The redshift space 
correlation function of quasars is described by a power- 
law that requires a bias factor of b ~ 3.5 for the CORE 
sample, implying that these quasars reside in halos with a 
typical mass of ~ 101? Mo. Members of the BOSS collab- 
oration will soon release results that constrain the quasar 
luminosity function, determine a composite quasar spec- 
trum (as in , report variable 
absorption in BAL quasars, and many other studies of 
quasar physics. In addition to results from the primary 
BOSS galaxy and quasar targets, analysis of data from 
the ancillary programs described in Appendix [B] contin- 
ues and will result in an even more diverse array of find- 
ings than would be possible with the main samples alone. 


7.3. BOSS Cosmology 


As explained in {2.1} we intend to complete spec- 
troscopy with BOSS over the ~10,000 deg? footprint 
that falls at high Galactic latitude and was imaged by 
SDSS-I and II. The primary goal of the galaxy red- 
shift survey is to achieve BAO distance scale constraints 
that are close to the limit set by cosmic variance out 
to z — 0.6; the only substantial (factor-of-two) improve- 
ment possible at these redshifts would be to cover the 
remaining 37 steradians of the sky. The BOSS Lya for- 
est survey will pioneer a new method of measuring three- 
dimensional structure in the high-redshift universe and 
provide the first BAO measurements of distance and ex- 
pansion rate at z > 2. 

The first small scale clustering statistics of the LOWZ 
sample from the BOSS data are reported in|Parejko et al] 
(2012). Measurements of clustering in the CMASS 
sample from the first year of BOSS data are found in 
(2010). That analysis includes constraints 
on the halo occupation distribution. With the full DR9 
sample, the BOSS collaboration carried out a series 
of more ambitious clustering analysis. For example, 
measure the growth of structure and 
expansion rate at z — 0.57 from anisotropic clustering, 
and[Sánchez et al] explore the cosmological impli- 
cations of the large-scale two-point correlation function. 
The analysis for these cosmology studies shares much of 
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the same framework as the galaxy BAO study. Toward 
that goal, (2012) generate a large sample 


of mock galaxy catalogs to characterize the statistics of 
large scale mul present an anal- 

sis of potential systematics in the galaxy sample, and 
measure structure growth using pas- 
sive galaxies. 

The efforts described in the previous paragraph 
made possible the first BOSS BAO measurement 
(Anderson et al]|2012). Using 264,283 CMASS galaxies 
over 3275 deg? and a redshift range 0.43 < z < 0.7, we 
determined the angle-averaged galaxy correlation func- 
tion and galaxy power spectrum in an effective volume 
of more than two Gpc?, the largest sample of the Uni- 
verse ever surveyed at this density. In combination with 
the SDSS LRG sample, the BAO detection provides a 
1.7% measurement of the distance to z = 0.57, the most 
precise distance constraint ever obtained from a galaxy 
survey. The resulting constraints on various cosmological 
models continue to support a flat Universe dominated by 
a cosmological constant. 

Measurements of the three-dimensional correlation 
function using the first year of BOSS Lya spectra are 
found in (2011). This analysis also reports 
the first detection of flux correlations across widely sep- 
arated sightlines, a measurement of redshift-distortion 
due to peculiar velocities, and constraints on the linear 
bias parameter of optical depth. The complete five year 
quasar sample should provide the means to extend the 
analysis and observe the BAO feature 
at z > 2 for the first time. 

As described in (2011), we have 
used a Fisher matrix formalism to predict the BAO 
constraints from the five year BOSS galaxy and 
ne forest epe E SIE on 
McDonald & Eisenstei 9001). and to predict the impact 
of these constraints on cosmological parameter determi- 
nations. For the galaxy survey we forecast lo precision 
of 1.0% on Da(z) and 1.8% on H(z) at z = 0.35 (bin 
width 0.2 < z < 0.5) and corresponding errors of 1.0% 
and 1.7% at z = 0.6 (bin width 0.5 < z < 0.7). Errors 
at the two redshifts are essentially uncorrelated, while 
the errors on D(z) and H(z) at a given redshift are 
correlated at the 40% level. Dividing into smaller bins 
would yield a finer grained view of w(z) evolution, with 
larger errors per bin but the same aggregate precision. 
For the Lya forest we update the forecast slightly from 
(2011), predicting a measurement pre- 
cision of 1.996 at a redshift approximately z — 2.5 on an 
overall dilation factor scaling both DA4(z) and H~!(z). 
This result was confirmed with a full simulation of the 
survey using mock quasar spectra (Le Goff et al] 2011]. 
Da(z) and H(z) have a higher degree of degeneracy than 
in the galaxy sample, with a correlation coefficient of 0.6; 
we therefore do not report projections for individual con- 
straints here. In detail, Lya forest projections depend on 
the level of clustering bias and redshift-space distortion 
in the forest, which are only now being measured pre- 
cisely in ongoing analyses of the DR9 quasar sample. 

The impact of these measurements on cosmological 
parameter determinations depends on the assumptions 
about external constraints from other data sets and 
about the underlying space of theoretical models. We 


have computed forecasts (Appendix A of[Eisenstein et al] 
2011) within the framework adopted by the DETF 
(Albrecht et al][200G), which assumes an inflationary 
cold dark matter cosmology, allows departures from spa- 
tial flatness, and employs a dark energy equation of 
state described by w(a) = wp + Wa(ap — a), where 
a = (14 z) ! and wp = w(zp) is the value of w 
at the “pivot” redshift zp ~ 0.5 where it is best con- 
strained by BOSS. When combining the forecast BOSS 
BAO constraints with Planck priors for CMB anisotropy 
measurements and “Stage II" priors that approximately 
describe the constraints from existing dark energy ex- 
periments, we project lo errors of 0.030 on wy, 0.320 
on Wa, 2 x 107? on the curvature parameter Q;, and 
0.9% on the Hubble constant Ho. Combining BOSS 
data with measurements from other, upcoming “Stage 
III" experiments, such as supernova, weak lensing, and 
cluster abundance data from the Dark Energy Survey, 
will further tighten these constraints, breaking parame- 
ter degeneracies and allowing cross-checks of results from 
different methods. Within BOSS itself, analyses that 
exploit the full shape and redshift-space anisotropy of 
the galaxy correlation function or power spectrum (e.g.; 
“or that use other 
statistical approaches sensitive to higher order correla- 
tions, can achieve tighter constraints on dark energy and 
curvature, can test whether the rate of cosmic structure 
growth is consistent with the predictions of General Rel- 
ativity, and provide sensitivity to neutrino masses, infla- 
tion physics, extra radiation backgrounds, and other de- 
partures from the simplest forms of the ACDM scenario. 
'These approaches make stronger demands on theoretical 
models of non-linear gravitational evolution and galaxy 
bias than does BAO, but the BOSS clustering measure- 
ments allow detailed, high-precision tests of these mod- 
els. 

'The success of BOSS, as documented in this paper, 
should be viewed as strong encouragement to future ef- 
forts in constraining BAO over large volumes and large 
redshifts. BOSS data will have broad impact on our un- 
derstanding of large scale structure, massive galaxy evo- 
lution, quasar evolution, and the intergalactic medium. 
The ultimate impact on fundamental cosmology depends 
in part on what Nature has in store. Any clearly detected 
deviations from a flat universe with a cosmological con- 
stant would have profound implications, taking us closer 
to understanding one of the most remarkable scientific 
discoveries of our time. 
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APPENDIX 
APPENDIX A: FLAGS FOR PRIMARY TARGET CLASSES 


The algorithms for targeting various classes of galaxy and quasar targets described in $2] and several addi- 
tional classes of object for calibration are all recorded in the BOSS_TARGET1 flag bit. A brief description 
of those objects is found below. The density and survey area containing each type of object is found in Table [8] 
The BOSS_TARGET2 keyword serves as a placeholder for additional targets, but has not yet been used in SDSS-III. 


Galaxy Target Selection: As described in the galaxy sample used to derive new BAO constraints was divided 
into two principal samples based on redshift. The LOWZ sample (represented by the flag GAL LOZ) targets the 
redshift interval 0.15 < z < 0.43 while the CMASS sample targets the redshift interval 0.43 < z < 0.7. The CMASS 
sample is divided into four subsamples of galaxies, with significant overlap between several subsets: 


e GAL. CMASS is the core of the high redshift BOSS galaxy sample, with selection criteria described in the 
main text. The flag was assigned to a test sample in the BOSS commissioning data found in bossl and boss2 
and should not be used in those chunks. It was assigned to the full sample of CMASS objects starting in boss3. 


e GAL. CMASS.COMM represent galaxies that were targeted using a commissioning selection algorithm to 
investigate possible targeting strategies. Objects were chosen so that d, > 0.55, i < 20.14 + 1.6(d, — 0.8), 
17.5 < i < 20.0 and r — i < 2 where r and i are cmodel magnitudes. Additionally, objects were required to have 
ifib2 < 22 and ipsp — tmodet > 0.2 + 0.2(20.0 — imoder). This sample is complete only for the first two BOSS 
commissioning chunks, boss1 and boss2. The flag was never phased out and appears in DR9 from objects in later 
stages in the tiling, representing a subset of galaxies that would otherwise satisfy the GAL. CMASS.COMM 
criteria. It should not be used for boss3 or any chunks thereafter. 


e GAL CMASS.SPARSE targets have a color-magnitude cut that is shifted relative to GAL. CMASS to 
i < 20.14 + 1.6 x (dj — 0.8) to allow for completeness studies. Galaxies are either flagged GAL. CMASS or 
GAL.CMASS.SPARSE, but not both. This flag was introduced in boss3, after the commissioning period for 
galaxy target selection. 


e GAL. CMASS.ALL is used internally in the target selection code, representing the union of the GAL. CMASS 
and GAL_CMASS_SPARSE targets for boss3 and thereafter. Because the sample of galaxies identified by 
this flag can be reconstructed by the union of GAL.CMASS and GAL.CMASS.SPARSE, it provides no 
additional information and should rarely be needed by the user. 


Quasar Target Selection: The use of quasar target flags evolved significantly throughout the first two years of 
the survey as different algorithms were tested to maximize the targeting efficiency. All programs with the QSO. prefix 
in Table[8] were targeted as potential quasars. A detailed explanation of each program is given M rcr (0120). 
we simply report the densities and survey area for each program. A few target classes bear special mention: 


e The QSO_UKIDSS and QSO. KDE. COADD target classes were only used in the boss1 chunk of commis- 
sioning data on Stripe 82, covered by a small number of plates between 0? < a « 40°. 


e The QSO_CORE_ED and QSO_CORE_LIKE classes were implemented in the second year. 


e The QSO. CORE and QSO. BONUS classes were deprecated after the boss2 chunk and replaced by the 
QSO. CORE. MAIN and QSO. BONUS. MAIN classes for the rest of the survey, beginning with chunk3. 


Calibration Programs: There are five target classes dedicated to calibration programs within BOSS. The most 
prevalent target flag is STD. FSTAR, used to denote objects chosen as likely F stars as described in d3.2] These 
objects are used both for flux calibration and correction for telluric absorption. The other four target classes were 
only used in the Stripe 82 commissioning plates in the boss1 chunk. These contain objects to aid in the development 
of templates for classification and redshift determination (Bolton et al] (20120). In order for the classification to be 
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as robust and informative as possible, the set of template spectra were selected to cover a wide range in the space 
of intrinsic spectral variation. Several classes contain objects previously observed in SDSS; the extended wavelength 
coverage of BOSS requires repeat spectra of these objects to cover 361 nm — 1014 nm in the observer frame. The 
targets are divided into four categories: 


e TEMPLATE. GAL.PHOTO were galaxies selected to uniformly cover the space of [(g —r), (r— i), i]. Objects 
were first chosen with dereddened cmodelmag i < 20, gri», Tfib, and ipfi > 16.5, SNR > 5.0 for gri model 
magnitudes to limit scatter in color measurements, and an absolute value of the difference between the model 
magnitude and cmodel magnitude in 7 less than 0.8. Next, for each galaxy, the total number of galaxies within 
a radius of 0.1 magnitudes in [(g — r), (r — i), i] space was computed and taken as an estimate of the local 
density in that space. A subsample was then drawn at random, with the probability of retaining any given 
draw proportional to the inverse of the local density. T'his generates an approximately uniform sampling of the 
populated region within the color-color-magnitude space, where colors g — r and r — i were defined using model 
magnitudes and 7 was determined using cmodel magnitudes. 


TEMPLATE.QSO.SDSSI were selected from the DR7 catalog of quasars (Schneider et_al.[2007)). 'The sample 
is restricted to objects with dereddened PSF magnitudes in ugriz « 20.0 and gri fiber magnitudes > 16.5. A 
random subsample was chosen after imposing a uniform distribution in redshift out to z « 3.2 and taking all 
quasars at redshifts z > 3.2. The PCA redshift templates in BOSS are based exclusively on this sample. 


TEMPLATE_STAR_PHOTO were selected in similar fashion to the galaxy sample above, with dereddened 
PSF magnitudes brighter than 19.25 in any one of ugriz, SNR > 5.0 in all filters using PSF magnitudes, and 
fiber magnitudes fainter than 16.5 in each of gri. A local density estimator within the PSF magnitude space of 
[((u— g), (g —r), (r— i), (6 — z)] was computed from the number of stars within a 4-sphere of radius 0.1 magnitude 
centered on each star. A random subsample of stars was drawn with the probability of retaining any given draw 
proportional to the local density to the power -0.7 (compared to -1 for the galaxy sample). This heuristic factor 
is chosen to retain some preferential weighting in favor of the core stellar locus, while still achieving significant 
representation from the less densely populated regions of color space. 


TEMPLATE STAR.SPECTRO were selected from the SDSS (non-SEGUE) spectroscopic database for 
re-observation to provide a sample of spectra that cover the major classes of stellar objects that lie away from 
the main stellar locus. The objects were required to have been both targeted as stars photometrically and 
classified as stars spectroscopically. The objects are required to have dereddened PSF magnitudes brighter 
than 19.25 in ugriz and fiber magnitudes fainter than 16.5 in each of gri. Stars were then selected at random 
to represent each of the following SDSS target categories identified by the DR9 LEGACY TARGETAYE 
and LEGACY_TARGET2® flags: STAR. BHB, STAR. CARBON, STAR- -BROWN DWARF, 
STAR_SUB_DWARF, STAR.CATY VAR, STAR RED DWARF, STAR WHITE DWARF, 
STAR. PN, REDDEN.STD, SPECTROPHOTO.STD, and HOT. STD. 


#1 http://www.sdss3.org/dr9/algorithms/bitmask_legacy_targetl.php ^ s http://www.sdss3.org/dr9/algorithms/bitmask_legacy_target2.php 
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TABLE 8 
BOSS PROGRAMS WITH BOSS_TARGET1 FLAG 
Program Bit Density Survey Area 
Number (deg~?) (deg?) 

GAL_LOZ 0 20-30 10,000 
GAL_CMASS 1 110 9,750 
GAL_CMASS_COMM 2 115 500 
GAL.CMASS.SPARSE 3 5 9,750 
GAL_CMASS_ALL 7 115 9,750 
QSO_CORE 10 14 ~ 250 
QSO_BONUS 11 17 ~ 250 
QSO.KNOWN.MIDZ 12 1.5 10,000 
QSO.KNOWN.LOHIZ 13 0.25 ~ 100 
QSO.NN 14 20 10,000 
QSO.UKIDSS 15 1.0 ~ 50 
QSO.KDE.COADD 16 27 ~ 50 
QSO.LIKE 17 28 10,000 
QSO.FIRST.BOSS 18 1.0 10,000 
QSO.KDE 19 28 10,000 
STD.FSTAR 20 4.6 10,000 
TEMPLATE_GAL-PHOTO 32 4.2 ~ 50 
TEMPLATE_QSO_SDSS 33 2.6 ~ 50 
'TEMPLATE.STAR.PHOTO 34 3.1 ~ 50 
TEMPLATE_STAR_SPECTRO 35 1.2 ~ 50 
QSO.CORE.MAIN 40 20 10,000 
QSO.BONUS.MAIN 41 19 10,000 
QSO.CORE.ED 42 10 9,000 
QSO.CORE.LIKE 43 11 9,000 
QSO.KNOWN.SUPPZ 44 0.25 ~ 100 


APPENDIX B: TARGET SELECTION AND SCIENTIFIC MOTIVATION FOR ANCILLARY SCIENCE PROGRAMS 


The first category of ancillary science programs lies in the region of Stripe 82 (the celestial equator in the SGC) 
that has completed observations and will be included in DR9. The second set includes programs that are dispersed 
throughout the full BOSS footprint and will continue to accumulate data throughout the survey. Additional ancillary 
programs may be added in the future; those programs will be described in future data release papers. All ancillary 
targets are assigned fibers at a priority that is lower than the primary galaxy and quasar targets; sample selection 
for these programs is therefore not complete. Here we provide a comprehensive description of the motivation, target 
selection, and the target densities for these programs. For programs that are divided into subprograms, we also provide 
a description of the ancillary target flags and selection for each subprogram. The target bits and statistics for the 
ancillary science targets are reported in Table[9]and Table[I0]at the end of this appendix. The target bit names appear 
in bold font in what follows. 


Stripe 82 


Several of these BOSS ancillary programs are contained completely in the Stripe 82 region. We finished all plates 
in Stripe 82 in 2011, leading to a group of ancillary programs that are included in its entirety in DR9. Starting with 
the nearest objects, we explain the rationale and target selection behind each of these programs below. 


of Stripe 82 allows identification of tran- 


The Transient Universe Through Stripe 82: The repeat imaging 
sient and variable phenomena of all sorts (for example 2008 
. In this program, several classes of variable 
point sources and high-proper motion stars discovered in Stripe 82 photometry were targeted for spectroscopy. These 


objects include flaring M stars (Kowalski et al. , faint high proper motion stars (Scholz et al||2009; Schmidt et al. 
2010¢), candidate low-metallicity M dwarfs, and diverse samples of variable stars (Roelofs et al. 


2008; Blake et al. [2008). BOSS spectra were obtained to characterize the statistical properties of these categories of 
transient objects. Approximately 600 objects were targeted across Stripe 82. The classes of transient objects were 
selected as follows: 


e FLAREI, FLARE2 consists of ~ 200 flaring M stars selected from the [Bramich et al] and [Ivezić et al] 
(2007) catalogs with gpsr < 21.4, ipgp < 19, 0.3 < (ipgp — zpsr) < 1.3, and exhibiting a flare event in the Stripe 
82 imaging data with amplitude Au > 1 mag Mec Ad 

e HPM consists of ~ 100 high proper motion stars also selected from the catalogs of and 
with an emphasis on faint objects with high proper motions (u > 0.1 mas yr~!). The goal of 
this target selection was to identify nearby low-mass stars and white dwarfs. Candidate nearby low-mass stars 
include faint stars (19 < zpsr < 20) with (ipse — zpsr) > 1.5, including objects with photometric detections in 
z band only. Candidate nearby white dwarfs include stars with (gesg — rpesr) ~ 0 and gpsr > 19. 
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e VARS consists of ~ 200 variables with gpsr < 21.5, (gesr — reser) > —0.5, and RMS variability in g band 
> 0.1 mag. These objects were selected to lie outside the stellar and quasar loci in (gesr — rpsr) — (resr — ipsr) 


color-color space as defined in [Fan et al] (1999). 


e AMC consists of ten candidate Am CVn stars selected based on their variability and colors ((upsr — gesr) > 0.4 
and (gesr — resr) > 0). 


As part of this ancillary program, a number of spectral templates were observed as well. These include a random 
sample of approximately 100 M stars, designated MTEMP, selected to span the spectra range MO to M8 following 
the color criteria outlined in [West et al] (17.5 < ipsp < 18.5, (rpsp — ipsp) > 0.5, (ipsp — zpsr) > 0.3). 
Approximately 70 candidate low-metallicity M stars (flag LOW MET) were also targeted; their colors were selected 
to be slightly outside the low-mass star stellar locus defined in fee (2005). The purpose of these sub-samples 
is to enable direct comparisons between the prevalence of line emission in the photometrically-selected flare stars and 
field M stars selected only on the basis of their colors. 


Host Galaxies of SDSS-II SNe: While many SNe Ia identified in the SDSS-II program (Frieman et_al.||2008) 


were spectroscopically confirmed and used in the first year cosmology results ( , most SNe and their 
host galaxies have not been observed spectroscopically. With redshifts of the host galaxies, SN lightcurves can be fit 
with fewer degrees of freedom, leading to identifications of SN type with higher confidence. The new classifications 
will allow a much larger number of SNe Ia to be placed on the Hubble diagram (Campbell et al] 2012). In this 
ancilary program, candidate SN hosts were drawn from a database containing 21,787 potentially variable objects 
determined from the repeat imaging of Stripe 82. The next stage of selection required coincidence 
of a signal in at least two passbands after vetoing regions of bright stars and variability from known active galactic 
nuclei (AGN) or variable stars. In total the list includes 4099 candidates of different SN types and different confidence 
levels as determined from a Bayesian classification of the lightcurve shapes. Fibers were assigned to 
3743 of these host galaxies to obtain a redshift and thereby improve the SN classification. Approximately one third 
of the targets have lightcurves that do not resemble SNe and are included as a control sample. The redshifts and 
new classification will lead to a nearly complete sample of SNe Ia to z « 0.4 and an enhanced cosmological analysis. 
'The new classification will also enable a large statistical study of the correlated properties between SNe Ia and their 
host galaxies (e.g., [2010). Subsamples of 
SNe from this ancillary program have already demonstrated correlations of residuals from the Hubble Diagram with 
spectroscopically-derived host properties (D'Andrea et al] [2011]. 'The candidates were selected after visual inspection 
and divided according to the following ancillary program flags: SN. GALLI: fibers assigned to the position of the 
core of the nearest candidate host galaxy (this accounted for 95% of the cases); SN-GAL2: fibers assigned to the 
position of the core of the second nearest candidate host galaxy (when the first was clearly a background object or 
a misclassified star; 4% of the targets); SN. GALS3: fibers assigned to the position of the core of the third nearest 
candidate host galaxy (1% of the targets); SN-LOC: already have SDSS host spectra so the fiber was assigned to 
the location of the original SN candidate; SPEC. SN: SNe identified from SDSS spectra (e.g. 
rather than photometric variability. 


Brightest Cluster Galaxies (BCGs): Over 3000 groups and clusters have been identified photometrically in 
Stripe 82 2012). The clusters were selected from ugriz photometry generated from 
the coaddition of Stripe 82 images (Annis et al. that is ~ 2 mag deeper than the rest of the area in the BOSS 
footprint. These clusters have photometric redshifts in the range 0 < z < 0.6 (median z = 0.32) and are expected to 
reside in dark matter halos with masses in excess of 2.5 x 101? M. Each cluster is assigned a “BCG”, which is simply 
defined as the brightest member associated with the cluster detection. To confirm the cluster redshifts, we obtained 
spectra of the likely BCGs with magnitudes 17 < igy» < 21.7 and colors that vary with redshift according to the cluster 
detection algorithm (Murphy et aliod. 1505 galaxies were requested and 1345 were observed that were not included 
in the main SDSS-I or -II galaxy, SDSS LRG, BOSS LOWZ, BOSS CMASS, 2SLAQ, or WiggleZ samples. This new 
sample of spectroscopically-confirmed clusters will enable a wide variety of science including the weak lensing of groups 
and clusters, the link between star formation and AGN activity in BCGs, and the LRG population of dark matter halos. 


High Quality Galaxy Spectra: Two plates (3615 and 3647) are being observed repeatedly throughout the survey 
several times per year to test reproducibility of extracted spectral parameters and to create exceptionally high SNR 
spectra of a subset of objects in the main BOSS sample. These observations will produce accumulated exposure times 
of more than 20 hours on all objects on the plates, including 96 LOWZ and 503 CMASS galaxies at a = 37° on Stripe 
82. These plates contain identical spectroscopic targets but are drilled for different airmass. The targets acquired 
16.5 hours of total exposure between December 1, 2009 and July 15, 2011 and will be observed in each of the three 
remaining years of the survey. The final observations will enable detailed spectral population studies of intermediate 
redshift galaxies. The data also provide a platform for tests of the consistency in spectroscopic parameter extraction 


(e.g., 'Thomas et al., in Ll and of systematic errors in the models of continuum fitting in the Lya 
forest region of the quasar sample (e.g., 2012b). 
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No Quasar Left Behind: We observed unresolved sources that had not been previously observed in SDSS 
spectroscopy that exhibit photometric variability statistically similar to that of spectroscopically confirmed quasars. 
Approximately 1500 targets were selected from 11,000 variable sources with 16.2 < ipsp < 20.5 identified according 
to the technique outlined in (2011). This study complements the sample of “Variability Selected 
Quasars" described below but targets brighter objects without color cuts, leading to a higher density of lower redshift 
quasars. The adopted variability-based selection criteria correctly identify 9696 of previously known quasars, and more 
than 80% of targets are expected to be quasars in the redshift range 0 < z < 5. The brightest of these (ipgr < 19) 
were observed to test the completeness of the color-selected SDSS sample (Boss et a] O12. 'The fainter subset 
represents a nearly complete sample that was selected from uniform photometry. In addition, the program provides 
an invaluable training sample for optimizing quasar selection algorithms based on photometric variability, vital for 
future synoptic surveys. 


Variability Selected Quasars: Variability is used to improve the selection efficiency for quasars around z — 2.7 
and ~ 3.5, where they lie in the region of color-color space that is occupied by the stellar locus. The selection method 
is quite similar to that used for the variability target selection used to select the main sample of quasars in Stripe 82 
as described at the end of 'The sample also complements the sample of *No Quasar Left Behind" but includes 
fainter quasars and imposes a loose color selection to select primarily objects at z > 2.15. In this method, objects 
were included with igp2 > 18, (gesr — ipsr) < 2.2, (upsr — gesr) > 0.4, and cı < 1.5 or c3 < 0. Colors cı and c3 are 


( 
defined in [Fan et al] (1999) as 


cı —0.95(u — g) + 0.31(g — r) + 0.11(r — i), 
c3 = —0.39(u — g) + 0.79(g — r) + 0.47(r — i). 


A variability neural network is used to quantify the likelihood that each object is a quasar. The neural network 
takes as an input (1) the x?s between the light curve in each band and a model assuming no variability and (2) a 
structure function derived from the lightcurve as in [Schmidt et al] (20104). Targets require a probability of being a 
quasar from the variability neural network greater than 0.95. Candidate quasars are chosen from ~60 imaging epochs 
from the last ten years, resulting in about 15 deg? quasar candidates. After removal of previously known quasars 
and removal of candidates that were already included in the main quasar selection, 3.4 deg? were selected for this 
sample. These data address the completeness of color selections and identifies obscured objects that would not be 
selected otherwise (Palanque-Delabrouille et al][2011]). These data are also being used to demonstrate the efficiency 
of variability target selection for possible implementation in future surveys such as BigBOSS (Schlegel et al][2011], a 
ground-based dark energy experiment to study BAO and the growth of structure with deeper observations than BOSS. 


Reddened Quasars: Quasar candidates likely to be intrinsically reddened with E(B-V) > 0.5 were selected from 
SDSS photometry and either FIRST or the Two Micron All Sky Survey (2MASS; [2008). The goal 
was a better determination of the distribution of reddenings in intervening absorbers and at the quasar redshift (e.g., 
[Ménard et al] [2008), and investigating reddening as a function of quasar properties. Targets 
were selected based on color selection and stellar morphology. All targets were required to have 17 « zpsr « 21 
with Galactic extinction colors satisfying (gpsr — ipPsr) — 30(gpsp—ipsr) > 0.9, where o is the measurement error 
in the color. Targets selected from SDSS plus FIRST were denoted RQSS.SF or RQSS.SFC (the trailing C 
indicates a CHILD object in SDSS, as in and were additionally required to match within 
2" of a FIRST source. Targets selected from SDSS plus 2MASS (RQSS_TM or RQSS_TMC) were additionally 
required to match within 3" of a 2MASS source having 11 < K < 15.1, (J — K) > 1.25, and a 7-dimensional color 
distance greater than 50 (this is the distance in units of x? to the point on the stellar locus 
with the same (g — i) color as the object). The program was discontinued after the first year due to low yield, and 
targets are included on only 13 completed plates. It is superceded by the program “K-band Limited Sample of Quasars”. 


K-band Limited Sample of Quasars: Potential quasars are identified via photometering UKIDSS 
at the position of SDSS sources to a limiting magnitude Kag < 19.0 (Kyega < 17.1). The 
intention was to identify highly red and reddened quasars not included in the main selection methods. The sample 
allows an investigation, for instance, of the incidence of BAL quasar samples, or of highly reddened gravitation- 
ally lensed quasars. In the original selection, approximately 2000 targets were identified that have (gesr — ipsr) « 
1.153 x (ipsr — Kap) — 1.401 to avoid the color locus of the stellar main sequence, and (upsr — gesr) > 0.4 to exclude 
UV excess quasars (i.e., unreddened quasars at redshifts of about z « 2.15). Here, the K-band magnitudes are on the 
Vega system, while the SDSS photometry is AB. The targets not included in Stripe 82 were effectively incorporated 
into the BONUS sample through the XDQSO selection and this ancillary program was not 
continued for the remainder of the BOSS footprint. This ancillary program superseded the ancillary program “A 
Large Sample of Reddened Quasars”, as the superior UKIDSS photometry could better target red, reddened and high 
redshift quasars than 2MASS. In addition, the cut of (upsr — gpsr) > 0.4 better complemented the BOSS mission to 
target mostly higher redshift quasars. 
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Full BOSS Survey 


The remaining BOSS ancillary science programs are distributed over some or all of the full 10,000 deg? footprint 
and are selected from single-epoch imaging data. Observations will continue through the end of the survey, and 
additional programs will be added in the future. As with the Stripe 82 programs, we list these programs in order of 
increasing distance. 


Very Low Mass Stars and Brown Dwarfs: Very low mass stars and brown dwarfs (spectral types M8, M9 
and L) are ideal tracers of the kinematic properties of the Milky Way thin disk. While the program “The Transient 
Universe Through Stripe 82" uses variability information over Stripe 82 to identify rare classes of stars, this program 
uses photometric selection over the much larger BOSS survey area to identify new low mass stars and brown dwarfs. 

SDSS I and II yielded a wealth of spectroscopic data of these ultracool dwarfs 
[2011], but were limited by the small number of spectra observed for stars with spectral types later than M7. Additional 
observations of these objects are essential to understand the properties of magnetic activity in these ultracool dwarfs 
(extending the results offWest et aL [2008]. The data will also enable us to use kinematics to understand the distribution 
of ages, especially at the stellar/sub-stellar boundary. Finally, the sample will contain a class of L dwarfs which are 
peculiarly blue in the near-infrared but have typical L dwarf colors in SDSS i — z (Schmidt et_alJ[2010b). 

The program was divided into different densities in Stripe82 (5 deg?) and the rest of the BOSS footprint (1 deg ?). 
In order to select the cleanest possible sample, we included 2MASS magnitudes as part of our selection criteria. Our 
Stripe 82 criteria were: 


e ipsr — Zpgr > 1.14 
e ipgp < 21 
e ipsp — J > 3.7 
e 1.9 < zpsF >- J «4 
The criteria for the rest of the BOSS footprint are: 
e ipsr — Zpgr > 1.44 
e ipgp < 20.5 
e ipsp — J > 3.7 
el19-«zpsp—J <4 


Here, the J band photometry, unlike the SDSS photometry, is Vega-based. 

'The targets are divided into subsamples with different priorities. L dwarfs are both less common and fainter, so our 
priority was to detect L dwarfs over late-M dwarfs. Targets likely to be bright L dwarfs (BRIGHTERL; ipgp < 19.5, 
TPSF — ZPSF > 1.14) are assigned first priority and fainter L dwarfs (FAINTERL; ipsr > 19.5, ipsr — zPsrF > 1.14) 
are assigned second priority. The bright M dwarfs (BRIGHTERM;; ipsp < 19.5, ipgp — zpsr < 1.14) are assigned 
third priority and FAINTERM dwarfs are assigned lowest priority. 


Low-Mass Binary Stars: Ultra-wide, low-mass binaries probe how dynamical interactions affect and shape 
the star formation process and the environment in which the processes occur. The binaries are also ideal coeval 
laboratories to constrain and calibrate properties of low-mass stars, as they were presumably born at the same time 
of the same material. The Sloan Low-mass Wide Pairs of Kinematically Equivalent Stars (SLoWPoKES; [Dhital et al] 
catalog is the largest sample of wide, low-mass binaries. The objects in the catalog were identified from 
common proper motions and photometric distances but lacked information regarding radial velocities. The BOSS 
spectra will be used to measure radial velocities for the components and confirm their physical association. Targets 
for this program were selected in two ways: (i) SLoWPoKES systems of spectral type MO or later which had robust 
SDSS/USNO-B proper motions and (ii) systems of spectral type M4 or later, without proper 
motions. Both sets of targets were selected to have angular separations of 65-180" to avoid fiber collision and to be 
brighter than gy» = 20 to achieve the critical SNR. The pairs from the second sub-sample are valuable for probing the 
lower-mass mid- and late-type M stars, which were underrepresented in SLoWPoKES. Although the lack of proper 
motion matching for these pairs makes them more susceptible to chance alignments, radial velocities from BOSS 
spectroscopy will confirm their common motions. From the combined sample, 500 pairs were randomly selected for 
targeting with BOSS, and both components will be observed. The spectra will be used to measure and calibrate the 
metallicity and mass-age-activity relationship for low-mass stars (e.g., [Dhital et a1][2012]), especially at the mid-late 
M spectral types. There was an error in correcting the positions of the target for their proper motions in the first 
year, affecting targets in plates numbered less than 3879 and between 3965-3987; few of these targets resulted in 
usable spectra. 

White Dwarfs and Hot Subdwarf Stars: The SDSS multi-color imaging efficiently distinguishes hot white dwarf 
and subdwarf stars from the bulk of the stellar and quasar loci (Harris et #1) 2003). Special target classes in SDSS 
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produced the largest spectroscopic samples of white dwarfs [Eisenstein et al][2006). However, 
much of SDSS white dwarf targeting required that the objects be unblended, which caused many brighter white dwarfs 
to be skipped (for a detailed discussion, see 8 5.6 of Eisenstein et all[2000). 'This BOSS ancillary program relaxes this 
requirement and imposes color cuts to focus on warm and hot white dwarfs. Importantly, the BOSS spectral range 
extends further into the UV, allowing full coverage of the Balmer lines. We require targets to be point sources with 
good u, g, and r photometry (following the clean point source selection from the DR7 documentation) and USNO 
counterparts. We restrict to regions inside the DR7 footprint with Galactic extinction of A, « 0.5 mag. Targets 
must satisfy g < 19.2, (u — r) < 0.4, —1 < (u — g) < 0.3, and —1 < (g — r) < 0.5, using extinction-corrected model 
magnitudes. Additionally, targets that do not have u —r < —0.1 and g — r < —0.1 must have USNO proper motions of 
more than 2" /century. Objects satisfying the selection criteria and not observed in previously in SDSS are denoted by 
the WHITEDWARF. NEW target flag, while those with SDSS spectra are assigned the WHITEDWARF_SDSS 
flag. Some of the latter objects are re-observed with BOSS in order to obtain the extended wavelength coverage. This 
color selection includes DA stars with temperatures above ~14,000 K and helium atmosphere white dwarfs above 
~8000 K, as well as many rarer classes of white dwarfs. Hot subdwarfs (sdB and sdO) will be included as well. Many of 
these stars are excellent spectrophotometric standards and can be tested in comparison to the BOSS F-star calibration. 


Distant Halo Giant Stars: Rare giants in the outer halo of the Milky Way are selected using a targeting strategy 
tested in SEGUE2 (the SDSS-III counterpart to SEGUE; Rockosi et al., in prep). Observations 
of the new BOSS targets are expected to increase the number of known halo stars beyond 60 kpc by a factor of 
ten. The full sample RED. KG will enable measurements of substructure of the Milky Way in position-velocity 
phase-space, a signature of the hierarchical assembly of the stellar halo. The data will thus provide tests of ACDM 
predictions for galaxy assembly in a part of the Galaxy expected to be almost entirely composed of debris from 
recent accretions (Xue et al] POTD). The sample will also enable dynamical mass estimates for the Milky Way out 
to 150 kpc. Objects with 17 < gay» < 19.5 are selected from a color box, with (upgr — gpsr, gpsr — rpsr) corners 
of (0.8,2.35), (0.8,2.65), (1.4,3.0), and (1.4,3.9). This selection identifies stars near the tip of the red giant branch 
in the region of the u — g/g — r diagram where giants separate from the locus of foreground dwarfs 
[2009]. In addition, total proper motion must be « 11 mas/year, zero within three standard deviations of the 
proper motion errors, to reject nearby stars. The 3" fiber magnitude must be i > 16.5 as an additional mechanism 
to reject bright targets. A smaller sample of targets is found on the commissioning plates. Denoted RVTEST, 
this sample targets stars previously observed spectroscopically as a test of the reproducibility of velocity measurements. 


Bright Galaxies: Bright galaxies were commonly missed in the original SDSS spectroscopic survey due to fiber 
collisions, bright limits (objects with model magnitudes r > 15, g > 15, or i > 14.5 were excluded), and errors in 
the deblending of overlapping images (Strauss et al]|2002). Approximately 10% of the brightest galaxies were not 
spectroscopically observed (Fukugita et al. . To improve the completeness of this spectroscopic sample, objects 
were chosen with Petrosian radius > 1" in r to reject stars, with no saturated pixels, and with extinction-corrected, 
Petrosian (Petrosian[1976; r magnitude between 10 and 16. Targets were also required to have an 
extinction-corrected Petrosian magnitude 7 « 20 and z < 20 to exclude misidentified satellite tracks which would not 
show up in the other bands. Galaxies without spectra (724,000 from the original list of 93,000) where then visually 
vetted to remove foreground stars that remained in the sample, detector artifacts (e.g. internal reflections) that were 
misidentified, and other sources of confusion. In cases where a foreground star was misidentified as the galaxy center, 
the target position was moved to the correct position. In cases of merging galaxies, we visually identified multiple 
targets corresponding to the centers of each galaxy. The list was cross-correlated with the Third Reference Catalog 
of Bright Galaxies (RC3 [Corwin et al][1994], and any targets that did not appear in the 
original SDSS spectroscopic survey were added to the target list (0.0596 of the final list). Finally, targets within 2" of 
a star that appears in the Tycho-2 Catalog were removed. The final sample includes 8637 galaxies 
over the BOSS footprint. 


High Energy Blazars and Optical Counterparts of Gamma-ray Sources: We targeted candidate optical 
counterparts of sources detected (or likely to be detected) by NASA's Fermi Gamma-ray Space Telescope (Atwood et al] 


[2009], with the goal to spectroscopically confirm and provide redshifts for candidate gamma-ray blazars, with model 
magnitude m « 21 in any of the three bandpasses g, r, or 7. We also require targets to have 3" fiber magnitudes 
m > 16.5 to minimize impact of fiber cross-talk. Ranked in approximate order of priority, fibers are assigned to targets 
from the following subprograms: 


e BLAZGXR: about 300 blazar candidates are assigned at highest priority to DR7 optical sources within Fermi 


gamma-ray error ellipses. Targets must also lie within the < 1’ radius error circle for X-ray sources in the ROSAT 
All-Sky Survey (RASS) (Voges et al] and within 2" of a FIRST (Becker et al][1995) radio source. 


e BLAZGRFLAT: about 175 blazar candidates detected with Fermi and the Combined Radio All-Sky Targeted 
Eight GHz Survey (CRATES; . Objects from the DR7 catalog within 2" of a CRATES radio 
source and within a Fermi error ellipse were targeted. 


e BLAZGXQSO: 95 further candidate X-ray and gamma-ray emitting quasars/blazars, including photometric 
quasar/blazar candidates (Richards et al. , as well as confirmed DR7 quasars/blazars (Schneider et al]2010) 
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revisited to assess optical spectral variability. Targets are selected that lie within < 1’ of a RASS X-ray source 
and within Fermi error ellipses. 


e BLAZGRQSO: 185 candidate radio and gamma-ray emitting quasars/blazars, including both photometric can- 
didates (Richards et al][2009), and DR7 confirmations (Schneider et al][2010) revisited to assess optical spectral 
variability. Targets are selected that lie within 2" of a FIRST radio source and within Fermi error ellipses. 

e BLAZGKX: 75 targets that are candidate high-energy counterparts but which lack typical (e.g., radio emission, 
unusual optical color, etc.) blazar properties were targeted to probe unknown classes of gamma ray sources. The 


optically brightest objects from DR7 within the Fermi error ellipses and within 1' of a RASS X-ray source were 
preferentially targeted. 


e BLAZXR: 1100 targets are selected that may plausibly emerge as Fermi sources, but are still below the detection 
limits in the early Fermi source catalogs. The approach is similar to the *ROSAT.A" target selection scheme 
described in and the “pre-selection” approach of (2008) that provided many 
of the gamma-ray counterpart associations reported in the first Fermi catalogs (Abdo et al. ROTON). Targets 
are chosen from the DR7 photometry catalog with radio coincidence (within 2” of a FIRST source) and X-ray 
coincidence (< 1' of a RASS source). This sample overlaps heavily with the BONUS quasar sample, but includes 
quasars at lower redshift. 


In addition, there were ten miscellaneous candidate blazar spectra taken in an early trial of this program. These 
targets were assigned subcategory names using the following flags: BLAZGVAR, BLAZR, and BLAZXRSAM. 


An X-ray View of Star-Formation and Accretion in Normal Galaxies: The extended wavelength coverage 
and improved throughput of BOSS relative to SDSS enable studies of the relationship between star formation and 
black hole accretion in galaxies using the key diagnostic Ha/N[H] to z ~ 0.49. For this study, a target list was derived 
from the matched Chandra Source Catalog (CSC; version 1; and SDSS DR7 photometric catalogs. 
High quality matches underwent visual inspection in X-ray and optical imaging and were required to have positional 
matches between the SDSS DR7 catalog and the CSC as defined by the matching method that 
incorporates positions, positional errors, and sky coverage. Matches with Bayesian probability « 0.5 suffer from a 
larger number of multiple matches and are discarded. In addition, targets were required to have model magnitude 
16.5 « r « 20.75 and Chandra off-axis angles 0 « 10’. Objects with existing SDSS spectroscopy, proper motions from 
exceeding 11 mas/year (selection criteria are described further in Hazeard et all 2010), or 
poor-quality X-ray measurements are removed from the sample. Because only sources with Chandra coverage were 
included, the target have a non-uniform distribution over the DR7 footprint. To avoid overlap between the targets 
selected for this program and the program “Remarkable X-ray Source Populations” described below, the target lists 
were cross-correlated (using a 2" match radius) and 754 duplicates were removed from this program designation. 


Remarkable X-ray Source Populations: This program targets remarkable serendipitous X-ray sources from the 
Second XMM-Newton Serendipitous Source Catalog (2XMMi; and the Chandra Source Catalog 
(CSC; that do not already have available identification spectra. Source types of primary interest 
include AGN (often obscured), X-ray binaries, magnetic cataclysmic variables, and strongly flaring stars. 

XMM-Newton sources were required to lie within 14' of the pointing center of an XMM-Newton observation, to have 
a 2XMMi detection likelihood greater than 12 as defined ui (2009), to have a statistically significant 
match to an SDSS photometric counterpart, and not to have any 2XMMi problem flags set. Similarly, Chandra 
sources were required to have an SDSS counterpart and not to have any indication of Chandra source confusion. All 
XMM-Newton and Chandra sources having Galactic latitudes of |b| < 20° were furthermore removed to emphasize 
extragalactic sources. 

The four XMM-Newton source types for this program were all selected using SDSS model magnitudes and are 
determined as follows: 


e XMMHR: 1030 XMM-Newton sources were selected that have unusual 2X MMi hardness ratios in the HR2-HR3 
plane. These sources were also required to have i « 20.5 (to ensure reasonable BOSS spectral quality) and to 
have 2-12 keV X-ray to i-band flux ratios greater than 0.03 (to minimize stellar contamination). Optical fluxes 
are defined as f, Av, where Av is the width of the bandpass. 


e XMMBRIGHT: 826 XMM-Newton sources were selected that have bright 2-12 keV fluxes (brighter than 
5 x 10-14 erg cm? s71). These sources were also required to have i < 20.0. 


e XMMRED: 627 optically red XMM-Newton sources were selected that have SDSS colors of g — i > 1.0. These 
sources were also required to have 7 < 19.3 and to have 2-12 keV X-ray to i-band flux ratios greater than 0.03. 


e XMMGRIZ: 149 XMM-Newton sources were selected that have “outlier” SDSS colors. Specifically, we selected 
SDSS point-source counterparts that have g — r > 1.2 or r — i > 1.0 or i — z > 1.4. These sources were also 
required to have i = 18-21.3 and to have 0.5-2 keV X-ray to i-band flux ratios greater than 0.1. 
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There is some overlap among these XMM-Newton source-selection approaches (e.g., a bright XMM-Newton source 
might also have optically red colors). At each selection step, we removed sources already selected in previous XMM- 
Newton selection steps (following the ordering above). There are 2632 selected XMM-Newton sources in total. 

As with the XMM-Newton sources, Chandra source types were all selected using SDSS model magnitudes and are 
determined as follows: 


e CXOBRIGHT: 387 Chandra sources were selected that have bright 2-8 keV fluxes (brighter than 5 x 
10714 erg cm? s71). These sources were also required to have i < 20.0. 


e CXORED: 635 optically red Chandra sources were selected that have SDSS colors of g — i > 1.0. These sources 
were also required to have 7 < 19.3 and to have 2-8 keV X-ray to i-band flux ratios greater than 0.03. 


e CXOGRIZ: 66 Chandra sources were selected that have “outlier” SDSS colors. Specifically, we selected SDSS 
point-source counterparts that have g — r > 1.2 or r—i> 1.0 or à — z > 1.4. These sources were also required 
to have 7 = 18-21.3 and to have 0.5-2 keV X-ray to i-band flux ratios greater than 0.1. 


Again, there is some overlap among these Chandra source-selection approaches, and again at each step we removed 
sources already selected in previous Chandra selection steps (following the ordering above). There are 1088 selected 
Chandra sources in total. We furthermore removed selected Chandra sources that were also selected XMM-Newton 
sources; this reduced the number of selected Chandra sources to 952. 

Star-Forming Radio Galaxies: Joint analysis of SDSS, FIRST, and the NRAO VLA Sky Survey (NVSS; 
1998) has shown that low redshift radio AGNs play an essential role in regulating the growth of massive 
galaxies (e.g. [2007]. However, much less is known about the detailed interplay of gas cooling and 
radio feedback in more luminous radio galaxies at higher redshifts. Current samples are incomplete, in particular for 
radio galaxies with significant on-going star formation. This ancillary program selects radio galaxies with blue colors 
at z > 0.3 that would otherwise be missed from the LOWZ and CMASS samples. Galaxy targets were selected from 
DR7 according to the following criteria: extended morphology in SDSS photometry; (2) clean ugriz model photometry 
and 17 < i < 19.9; (3) i pip < 21.7; (4) [((g — r) > 1.45] OR [(u — g) < 1.14 (g — r)] where photometry is determined 
from model magnitudes. The later criterion is designed to color-select objects at z > 0.3. We cross-matched this 
sample with the FIRST catalog (July 2008 version) and selected all objects within 3" of FIRST sources with fluxes 
> 3.5 mJy. Most targets were within 1.5". Finally, we rejected objects spectroscopically observed by SDSS-I/II, and 
objects meeting the target selection criterion for the galaxy samples. In total there were 4610 targets; we randomly 
sampled these to produce a final list of 4170 ancillary targets. 


Galaxies Near SDSS Quasar Sight-Lines: Obtaining accurate redshifts of galaxies projected near the lines 
of sight to quasars with existing SDSS spectra allows a study of the properties of galaxies that are associated 
with intervening quasar absorption systems. BOSS enables a study at small galaxy-quasar separations that could 
not be done with SDSS due to fiber collisions. Galaxy targets are selected by a ugr color cut to lie in a redshift 
range (z > 0.35) where MglI at 2800 À is detectable in SDSS spectra. The sample of spectroscopically confirmed 
quasars was selected with model magnitude g « 19.2 and redshift 0.7 < z « 2.1 from the SDSS DR7 quasar catalog 
(Schneider et allDO10). Galaxies were chosen that lie between 0.006’ and 1’ of a quasar with spectroscopy, with model 
magnitudes 17.5 < i < 19.9, Ag < 0.3, and (g — r) > 1.65 OR (u — g) < 1.14 x (g — r) — 0.4 to select objects with 
z > 0.35. The sample is weighted to have similar numbers of galaxies at separations b < 0.5' and 0.5 < b < 1’. A 
sample of approximately 3000 galaxies will help determine whether absorption is correlated with galaxy type, the 
covering fraction of absorption as a function of radius, and the size of velocity offsets. 


Luminous Blue Galaxies at 0.7 « z « 1.7: Studies from the second Deep Extragalactic Evolutionary Probe 
(DEEP2; reveal that the most luminous, star-forming blue galaxies at z ~ 1 appear to be a 
population that evolves into massive red galaxies at lower redshifts (Cooper et al]B008. Sampling 2000 color-selected 
galaxies in Stripe 82 and from the CFHT-LS Wide fields (W1, W3, and W4) allows a measure of the clustering of the 
rarest, most luminous of these blue galaxies on large scales. Such a measurement has not previously been conducted, 
as prior galaxy evolution-motivated surveys have a limited field of view and mostly target fainter galaxies. 

'The galaxy targets were color-selected based on the CFHT-LS photometric-redshift catalog (Coupon et al DOOD). 
Different color selections were explored using either the (upsr — gpsr, gpsr — rpesr) color-color diagram down to 
gesr < 22.5 or the (gesr — rpsr, rPsr — ipsr) color-color diagram down to ipsp < 21.3. Detailed description of the 
color selection and redshift measurement is M gan (2012). Using this dataset, photometric redshifts can 
be re-calibrated in the CFHT-LS W3 field, thereby reducing biases in redshift estimates at z > 1. Measurement of 
the galaxy bias of these luminous blue galaxies will be presented in Comparat et al (2012b, in preparation). This 
dataset has been important in motivating the “extended-BOSS” project (PI, J.-P. Kneib), a survey that is proposed 


to begin in Fall 2014 as part of the successor to SDSS-III. The data will also be used to improve the targeting strategy 
of future projects such as BigBOSS (Schlegel et a14|2011]). 


Broad Absorption Line (BAL) Quasar Variability Survey: Thousands of BAL quasars were discovered in 
the SDSS-I and -II MA cima 2009). In some cases, repeat spectroscopy showed variable absorption, providing 
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clues to the nature of the BAL phenomenon (e.g., [Lundgren et al][2007) [2010). Returning with 
BOSS to obtain repeat spectra on a much larger sample of these quasars allows a large-scale study of BAL variability on 
multi-year timescales in the rest frame. The resulting data provide insight into the dynamics, structure, and energetics 
of quasar winds. First results from this ancillary project are presented in [Filiz Ak ct al] (2012). 

The targets for this ancillary project were selected before the decision was made to re-target known quasars at 
z > 2.15 (see (2.4), and thus there is some overlap between these two samples. However, this ancillary project also 
provides many unique targets at z « 2.15. The main sample of BAL quasars chosen for study contains 2005 objects 
assigned the ancillary target flag VARBAL; this sample is about two orders of magnitude larger than those used 
previously to investigate DAL variability on multi-year timescales. These 2005 objects were selected to be optically 
bright (ipse < 19.28 with no correction for extinction) and to have at least moderately strong absorption in one of 
their BAL troughs (with a “balnicity index" of BIo > 100 km s^! as measured by [Gibson et al][2009]. In addition, 
only quasars that are in redshift ranges such that strong BAL transitions are fully covered by the SDSS-I, SDSS-II 
and BOSS spectra (from outflow velocities of 0-25000 km s^!) were included; see §4 of for further 
explanation. The corresponding redshift ranges are 1.96 < z < 5.55 for Si IV BALs, 1.68 « z « 4.93 for C IV BALs 
1.23 « z « 3.93 for Al III BALs, and 0.48 < z < 2.28 for Mg II BALs. Finally, for those objects in the [Gibson et al] 
catalog that have measurements of the SNR at rest-frame 1700 À (SN1700), we require that SN1700 > 6; this 
criterion ensures that high-quality SDSS/SDSS-II spectra are available for these targets. 

In addition to the primary VARBAL sample objects described above, the BAL quasar variability survey also targets 
102 additional BAL quasars selected with other approaches. These targets may violate one or more of the selection 
criteria utilized for the VARBAL targets, but they have been identified as worthy of new observations nonetheless. 
'The relevant source types for these additional BAL quasars are the following: 


e LBQSBAL and FBQSBAL are BAL quasars identified in the Large Bright Quasar Survey (LBQS; e.g., 
oco and the FIRST Bright Quasar Survey (FBQS; e.g., Wa [2000), respectively. They 
thus have LBQS or FBQS spectra predating the SDSS-I and -II spectra by up to a decade or more. 


e OTBAL (Overlapping-Trough BAL quasars) are BAL quasars with nearly complete absorption at wavelengths 
shortward of Mg II in one epoch and which in one case have already shown extreme variability (e.g.,|Hall et al. 


2002). 


e PREVBAL are BAL quasars observed more than once by SDSS-I and -II. They thus already possess more than 
one observation epoch for comparison to BOSS spectra. 


e ODDBAL are BAL quasars selected to have various unusual properties (e.g., [Hall et a1/|2002). For these 
objects, variability (or the lack thereof) between SDSS-I and -II and BOSS may help to unravel the processes 
responsible for their unusual spectra. 


Variable Quasar Narrow-line Absorption: Quasar absorption lines are plentiful in SDSS I and II and have 
been documented in a catalog of all lines and systems (QSOALS: York et alll2008). This catalog (York et al. 2012, in 
prep.), now updated through DR7, contains 60,000 uniformly detected quasar absorption line systems in which two or 
more transitions from common metal absorption lines (e.g., Mg II, Fe IT, C IV) are identified at the same redshift. This 
dataset has been used to study the statistics of quasar absorption lines (York et al][2008) and to confirm correlations 
with quasars as well as foreground galaxies projected along the line of sight 

It has been shown that smaller equivalent width BALs are more prone to variation on short timescales (e.g., 
[Lundgren et al][2001). A large survey of variability in narrow absorption lines (NALs) is therefore required in 
order to examine if this trend applies across a larger range in equivalent widths. Complementary to the ancillary BAL 
quasar variability study described above, this program seeks to compile the largest dataset of multi-epoch observations 
of quasar sight lines with known narrow absorption along the line of sight. Detections of variability in NAL systems 
hold great promise for identifying high-velocity intrinsic quasar absorption and mini-BAL emergence and for providing 
limits on the sizes of cold gas clouds in the extended haloes of luminous galaxies in the foreground. 

The targets of this program include quasars with 16.5 < gy» < 17.9 and redshift 0.7 < z < 2.2 from the DR7 quasar 
catalog (Schneider et al/|2010), which would otherwise be ignored by the primary BOSS target selection. Sight lines 
with known BALs (Gibson et al] are ignored, as this parameter space is being covered by the separate BOSS 
BAL variability program. Each of the sight lines targeted in this program contains a NAL system detected at > 4c 
(including multiple unambiguous transitions of Mg II, C IV, or both), which have been identified in[York et al] (2005). 
However, this program is not limited only to cases in which previously identified NALs disappear in later epochs, since 
NALs should have the same probability of emerging along these lines of sight with and without identified NALs. In 
total, this program targets about 3000 quasars with a target density of ~ 0.35 deg ?. 

As one of the science goals of this program is to determine the extent of variable NALs in velocity space relative to 
the quasar, the target list includes sight lines with NALs over a wide range in velocity. The following sub-groups allow 
for the identification of quasars and absorbers with particular characteristics. 


e QSO_RADIO_AAL: radio-loud with 1 associated absorption system (AAL; v<5000 km s^! in the quasar rest 
frame). 
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e QSO. RADIO.IAL: radio-loud with 1 intervening absorption system (IAL: v>5000 km s^! in the quasar rest 
frame). 


e QSO. AAL: radio-quiet source with 1 AAL 

e QSO.IAL: radio-quiet source with 1 IAL 

e QSO. RADIO: radio-loud source with multiple AALs and/or IALs 
e QSO. A ALs: radio-quiet source with multiple AAL and/or IALs 

e QSO. noA ALs: radio-quiet source with no AALs and multiple IALs 


Double-lobed Radio Quasars: Objects identified as optical point sources near the midpoint of pairs of FIRST 
radio sources are observed as potential double-lobed radio quasars. Such quasars are important for studying quasar 
evolution and interactions of radio jets with their local environment. Candidates are selected by identifying FIRST 
pairs with a separation less than 60" and no SDSS optical counterpart within 2" of either source. SDSS point sources 
located within a search radius that ranges between 2" and 5.3" (depending on the separation distance of the FIRST 
pair) from the midpoint are targeted. FIRST pairs with a flux ratio > 10 are rejected because true double-lobed sources 
are unlikely to have a high ratio of lobe-lobe flux density. The final catalog includes objects not spectroscopically ob- 
served with SDSS, not targeted in the main BOSS sample, and with 17.8 < ipsp < 21.6 (Galactic extinction-corrected). 


High Redshift Quasars: High-redshift quasars trace the evolution of early generations of supermassive black 
holes, provide tests for models of quasar formation and AGN evolution, and probe evolution in the intergalactic 
medium (IGM). However, the BOSS quasar survey selects objects only to z ~ 3.5. Light emitted 
by high redshift quasars at wavelengths shorter than Lya is absorbed by the IGM, meaning that for redshifts z > 5.7, 
quasars are detected in only the z band, the reddest filter in the SDSS imaging survey. We use areas with overlap 
imaging, thereby reducing contamination from cosmic rays and improving the photometry, to select high redshift 
quasar candidates in three redshift ranges to fainter magnitudes than in the SDSS survey. 

For the main survey, PSF magnitudes for objects with multiple detections are extracted from the Neighbors table in 
the DR7 Catalog Archive Server, and the detections are coadded in each band. Target selection is performed on the 
coadded photometry. For objects within Stripe 82, PSF magnitudes were extracted from the coadded image catalogs 
described akaa (2011). These catalogs combine roughly 20 epochs instead of just two, and permit selection 
of objects at fainter magnitudes. The first part of the program targets objects with similar color cuts imposed in 
the SDSS quasar survey 2002). The SDSS quasar target selection defined two inclusion regions in 
gri and riz color space for targeting z > 3.6 and z > 4.5 quasars, respectively (Richards et a1][2002). 'The first two 
high-z quasar ancillary programs are straightforward extensions of these color selection criteria to fainter magnitudes, 
with limits of ipsp < 21.3 in the main survey regions and ipsp < 21.5 (compared to ipsp < 20.2 for SDSS). The 
ancillary target flag QSO_GRI is assigned to objects meeting the gri color criteria, and the flag Q8O. RIZ to those 
meeting the riz criteria. In both cases, the primary color cut follows a diagonal line in the respective color plane 
(e.g, r — i < A(g — r) 4- B for the riz cut), and the intercept of this line is shifted slightly upwards for brighter 
objects. T'his approach allows objects with similar brightness to SDSS quasars to be selected with more relaxed color 
criteria. This sample will provide a probe of the quasar luminosity function at high redshift and improve small-scale 
clustering measurements. A final class of targets (QSO_HIZ) in this program are candidates for quasars with 
redshifts between 5.6 and 6.5. At these redshifts, quasars have extremely red i — z colors; this program targets all 
objects with (i — z) > 1.6 and z < 20.8 and that have no detections in the other SDSS bands. 


High-redshift Quasars from SDSS and UKIDSS: The final ancillary program described here targets high 
redshift quasar candidates through a combination of color cuts combining SDSS ugriz PSF photometry and UKIDSS 
YJHK aperture photometry (1" radius apertures). The addition of IR photometry from UKIDSS provides leverage 
to separate quasars at z ~ 5.5 from the red end of the stellar locus — indeed, the SDSS quasar survey was limited to 
z < 5.4 by the strong overlap in ugriz colors of higher redshift quasars with red stars. Quasar candidates were selected 
by matching stellar objects from the SDSS DR7 UKIDSS DR3 databases. The initial sample is drawn from SDSS 
with the cuts r—i > 1.4, i — z > 0.5, and z < 20.2. Likely stars are rejected with the criteria (H — K)vega « 0.53 
or (J — K)vega < 1.3(Y — J)vega + 0.32, taking advantage of the fact that quasars are redder than M stars at 
longer wavelengths and bluer at shorter wavelengths. The remaining candidates are then prioritized based on izY JK 
colors. For Stripe 82, the color criteria used for prioritizing targets were slightly relaxed, owing to the coadded ugriz 
photometry available from [Annis et al] (2011), which greatly reduced the initial stellar contamination from the riz 
selection. Objects selected with the relaxed criteria on Stripe 82 are given the HIZQSOS2 target bit. 
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TABLE 9 


BOSS ANCILLARY PROGRAMS WITH ANCILLARY_TARGET1 FLAG 


Primary Program 


Transient Universe 
Transient Universe 
Transient Universe 
Transient Universe 
Transient Universe 
Transient Universe 

High Energy Blazars 

High Energy Blazars 

High Energy Blazars 

High Energy Blazars 
Remarkable X-ray Sources 
Remarkable X-ray Sources 
Remarkable X-ray Sources 
Remarkable X-ray Sources 
BAL Variability 

BAL Variability 

BAL Variability 

BAL Variability 

BAL Variability 

BAL Variability 

Bright Galaxies 

Variable Quasar Absorption 
Variable Quasar Absorption 
Variable Quasar Absorption 
Variable Quasar Absorption 
Variable Quasar Absorption 
Variable Quasar Absorption 
Variable Quasar Absorption 
High Redshift Quasars 
High Redshift Quasars 
High Redshift Quasars 
High Redshift Quasars 
High Redshift Quasars 
High Redshift Quasars 
Reddened Quasars 
Reddened Quasars 
Reddened Quasars 
Reddened Quasars 

SN Host Galaxies 

SN Host Galaxies 

SN Host Galaxies 

SN Host Galaxies 

SN Host Galaxies 

Low Mass Binary Stars 
White Dwarf Stars 

White Dwarf Stars 

Very Low Mass Stars 

Very Low Mass Stars 

Very Low Mass Stars 

Very Low Mass Stars 

Very Low Mass Stars 

Very Low Mass Stars 

Very Low Mass Stars 

Very Low Mass Stars 
Distant Halo Giants 
Distant Halo Giants 

High Energy Blazars 

High Energy Blazars 

High Energy Blazars 

High Energy Blazars 

High Energy Blazars 

High Energy Blazars 

Star Forming Radio Galaxies 
X-ray View of Star-Formation and Accretion 
Remarkable X-ray Sources 
Remarkable X-ray Sources 
Remarkable X-ray Sources 
Luminous Blue Galaxies 
Galaxies near Quasar Sight-Lines 
Transient Universe 


a Sample is taken from the Stripe 82 region. 


SubProgram 


AMC 
FLAREI 
FLARE2 

HPM 

LOW.MET 
VARS 
BLAZGVAR 
BLAZR 
BLAZGXR 
BLAZXRSAM 
XMMBRIGHT 
XMMGRIZ 
XMMHR 
XMMRED 
FBQSBAL 
LBQSBAL 
ODDBAL 
OTBAL 
PREVBAL 
VARBAL 
BRIGHTGAL 
QSO_AAL 
QSO_AALS 
QSO_IAL 
QSO_RADIO 


QSO.RADIO.AAL 


QSO_RADIO_IAL 
QSO_NOAALS 
QSO.GRI 
QSO.GRI 
QSO.HIZ 
QSO.HIZ 
QSO.RIZ 
QSO. RIZ 
RQSS_SF 
RQSS_SFC 
RQSS.STM 
RQSS.STMC 
SN_GALI 
SN_GAL2 
SN_GAL3 
SN.LOC 
SPEC.SN 
SPOKE 


WHITEDWARF.NEW 
WHITEDWARF.SDSS 


BRIGHTERL 
BRIGHTERL 
BRIGHTERM 
BRIGHTERM 
FAINTERL 
FAINTERL 
FAINTERM 
FAINTERM 
RED.KG 
RVTEST 
BLAZGRFLAT 
BLAZGRQSO 
BLAZGX 
BLAZGXQSO 
BLAZGXR 
BLAZXR 
BLUE.RADIO 
CHANDRAV1 
CXOBRIGHT 
CXOGRIZ 
CXORED 
ELG 
GAL.NEAR.QSO 
MTEMP 


Bit 
Number 


DBAADWISEWNrRO 


Ke) 


P Sample is taken from BOSS footprint that overlaps with DR7 imaging data. 


Density 
(deg?) 
0.05 


opopo 


= 
—-"oocoooecocooo 


Survey Area 
(deg?) 


220? 
220^ 
220 
220? 
220? 
220? 

7650 

7650P 

7650? 

7650P 

7650P 

7650 

7650 

7650 
5740 
5740 
5740 
5740 
5740 
5740 

7650 

76505 

7650 

7650P 

7650P 

7650 

7650 

7650° 
2202 
2500 
220 
2500 
220 
2500 
~ 50 
~ 50 
~ 50 
~ 50 
2202 
220? 
220? 
220? 
220? 

7650 

7430? 

7430? 


TABLE 10 
BOSS ANCILLARY PROGRAMS WITH ANCILLARY TARGET2 FLAG 


Primary Program SubProgram Bit Density Survey Area 
Number (deg~?) (deg?) 

High Redshift Quasars with UKIDSS HIZQSO82 0 0.5 220° 
High Redshift Quasars with UKIDSS HIZQSOIR 1 0.3 700 
K-Band Selected Quasars KQSO.BOSS 2 1.0 220? 
No Quasar Left Behind QSO.VAR 3 6.5 220* 
Variability Selected Quasars QSO.VAR.FPG 4 3.4 220% 
Double-lobed Radio Quasars RADIO_2LOBE_QSO 5 0.3 7650 
Brightest Cluster Galaxies STRIPE82BCG 6 6.0 220? 


